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A " 1 

Library A— Acquisitions Department— Ordering 



l Start 

— r 

(Libn 1 s • off) 
Date stamp and 
send to Acq»Dept* 






ERIC 



Sort by 

Main Entry 




Stop 




A - 2 

Library A— Acquisitions Department- -Ordering 




© 



A - 3 

Library A— Acquisitions Department— Ordering 




File pink in 
Current Univ. 

PO file 

j 







Send othi 
copies t< 
Purch. D 


er 3 
o Univ* 
ept. 



ERIC 




A - A 



Library A““Acquisitions Department -“Ordering 





A - 5 

Librar y A* -Acquis it ions Department - -Rece iving 



Receiving J 



Bk. 


— ? 


Unpack 













yes 


Note date & dis- 
count price on 
req.cd.orig. & 
refile in ord.filei 






Put req. 
xerox in 


cd. 

bk. 







Shelve 

"Waiting Bills" 






Shelve bk. in 
"Waiting LC cds." 




Send bk. to 
Cat. Dept. 




no 



A - 6 

Library A- -Acquisitions Department— Receiving 





Hd. of Ord.Dept. 
cks • bill against 
pink Univ*PO not- 
ing price & part- 
ial shipments 




A - 7 

Library A— Acquisitions Department— Receiving 



* 





A - 3 

Library A- -Acquis it ions Department-»Receiving 




Send bill to 




Univ, Purch. 





A - 9 

Library B — Acquisitions Department — Ordering 




O 



A - 10 

Library B— Acquisitions Department— Ordering 





0 



0 



A - 11 

. 0 rary B— Acquisitions Department— Ordering 



O 



<■ 



File 5 p 
form in i 
ord. fil 


t. ord. 

curr. 

e 







Date stamp req. 
slip and retn. 
it to req*r. 



© 



Record in 

PO #, ven 
and date 


bk. 

dor 







Send remaining 
5 pts • of Univ. 
PO to Lib.Adm.Off< 



r 



Univ. PO *8 
(5 copies) 




A - 12 

Library B— Library Administration Office— 
—Ordering— 



^ Start 




File in curr # 
Utility Ord. 
File 



Stop 



A • 13 

Library B--Acquisition» Department -•Receiving 






Stop 



A — 14 



Library B— Acquisitions Department— Receiving 





A - 15 

Library B— Acquisitions Department --Receiving 





Stop 




A • 16 

Library B--Acquisitions 



Department--Receiving 



i 






A - 17 




no 




Type confirming 
Util. PO 



/I 



Refile PO's & 

Receiv.Rept. (or 
Dealer List) in 
curr .Util .File 



yes 



File PO's & in- 
voice in curr. 

Util. File 



Verify invoice 
against order 
adding exact price 







no 




Stop 



Library B— Catalog Apartment— Ordering LC Cards 
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A - 19 

Library B— Catalog Department --Ordering LC Cards 

■0 




Stop 



A - 20 

Library B— Catalog Department — Receiving LC Cards 





t 




A - 21 

Library C-- Acquisitions Department— Ordering 




o 

ERIC 




A - 22 

Library C— Acquisitions Department— Ordering 






yes 




Destroy 

ord, 

Rept. to req*r. 




A - 23 

Library C— Acquisitions Department— Ordering 




Stop 



A - 24 



Lib car y C— Bool: Distribution (Catalog Department)— Receiving 




no 



Discard yell, 
slip 




Stop 



A - 25 



Library C— Book Distrlbutlon(cataXog Department) ••Receiving 





A - 26 

Library C— Acquisitions Department— Receiving 






Attach yell. :• 
stiff to invoic 
& send to Bill 
Clk. 



Send bk.,blue 
slip & req. cd. 
to Bk.Dist* 



Discard grn. 
slip 




no waiting bks. 



Ck. invoices 



file 



yes 





Set aside to 
wait invoice 



(2 wks. max.) 




A - 27 

Library C— Acquisitions Department— Receiving 




o 

ERIC 




Library C 



Bk.(blue slip 
sometimes) 




ERIC 



A - 2C 



—Book Distribution (Catalog Department)— Receiving 





Stop 



Stop 



A - 29 

Library C— Book Distribution (Catalog Department)— Receiving 





A - 30 

Library C— Book Distribution (Catalog Department)— Receiving 



LC cds. 
in LC ord. / 
file / 



no 



yr 



X 



LC cds .ord fd. 



yes 






no 






Insert cds. 
in bk. 


1 




Shelve bk. in 
"Ready for Cat." 


© 


yes 


File blue slip 
in LC ord. file 




1 




Insert pink slip 
in bk. 


1 




Shelve bk. in 

"Waiting IXJ*d. 
cds." 



f?\ 



^StopJ 



A - 31 

Library C— Book Distribution(Catalog Department)— Receiving 






Stop 



Library C 



Invoice 
yell* stiffs 




A - 32 

—Acquisitions Department— Receiving 




A • 33 

Library (^“Acquisitions Department ••Accounting 





Stop 



A - 34 

Library C— Acquisitions Department* 







Pd, sched's, 
from Acct, 



yes 



no 



if 



File in complet- 
ed ret'd, pd, 
sched, file 



-Accounting 



Adjust lib, 
acct. 



Stop 



A -35 

Library D— Acquisitions Department— Ordering 




A - 36 

Library D~ Acquisitions Department— Ordering 




A - 37 

Library D-- Acquisitions Department--Orderin& 




Library D— Acquisitions Department— Receiving 



( 

l Start 





A - 39 

Library D— Acquisitions Department— Receiving 




1 

Pull rec] 
ji.d . fox 
proof & 
date on 


l.,yell. 
rm, and 
note 

rfifla 






Pull slip from 
outstd. ords. 
(Dept.) & note 
date 






Insert 

slips & i 
slip in 1 


pulled 

routing 

book 





yes Jfcstab*dl. 

Std. ) 
Ords, J 



y es / Blanket) 
Ords. 



yes_ 





A - 40 

Library D— Acquisitions Department— Receving 




A - 41 

Library D— Acquisitions Department— Receiving 




Shelve book 
in Cat. Dept. 
'Ho Proof" 




A -42 

library D— Acquisitions Department— Receiving Established 

Standing Orders 




A 



A • 4 3 

Library D~ Acquisitions Department-RecsivingoE.tsblished 




A - 4A 

Library D— Acquisitions Department— Receiving Blanket Orders 





A - 45 

Library D-- Acquisitions Department— Receiving Blanket Orders 



Ck, proof file 



Prepare blue 
& yell, forms 



File blue in 
Pub. Cat. 



Shelve on approp. 
shelves in Cat. 
Dept, w/ yell, 
form 





Stop 



Library D— Acquisitions Department— Receiving Blanket Orders 
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A • 47 

Library D— Acquisitions Department— Receiving Blanket Orders 





A • 4C 

Library D- •Acquisitions Department ••Receiving Hew Standing Orders 




Transfer signal 
from index to 
holdings cd. 



Verify bk. J 

against invoice 



Stamp invoice 
'’SO" 



Add PO # to 
invoice if 
lacking 



Record cost & 
invoice date to 
SO holdings cd. 



Record vol. # & 
year (or author - 
title)to SO 
holding^ Cd. 

Adjust encumber- 
ing if more than 
$2,00 diff. 

t 

Hark S(serial) 
on invoice if 
appropriated 



A - 4? 

Library D- -Acquis it ions Department --Receiving New Standing Orders 




Stop 



A - 5C 



Library D— Acquisition# Department— Receiving New Standing Orders 



Prepare blue 
and yell* slips 



File blue in 
Pub * Cat • 



Insert yell, 
in bk* 



Shelve on 
appropriate 
shelves in 
Cat* Dept* 





A - 51 

Library D— Acquisitions Department— Invoice Receiving 





Stop 



1 ■ y > -Acquis it ions 
Department— Accounting 



A • 52 





A - 53 



Library D-- Acquisitions Department -"Accounting 





A • 34 

Library D— Acquisitions Depart mant 




File white in 



Library D- -Acquisitions Department- -OP 




A • 50 

Library D- -Acquis it ions Department 




J QO. 




K)P Report Processing 



Try another 
dealer 




Try another 
dealer 




Stop 



A - 57 

Library E— Acquisitions Department - 



"Ordering 




Start 




At 3:30pm send 
yell., pink and 
buff to Cat. 
Dept. 



Next morning 
mail covdr ltr.& 
vouchers to vendor 



A - 5 C 

Library E— Acquisitions Department- -Ordering 




File re 
in on or 
by title 


q« cd. 
d. file 






Encumber funds 
using grn. ord. 
form 






File grn 
budget c 
file by 


• in 

ontrol 

vendor 



A - 59 

library ^--Catalog Department-- Ordering 







no 


File buff cd* 


in Pub, 


i Cflt • 







File pink & yell 
slips in "Wait- 
ing Books" file 



Retn* cd. & 
slips to 
Acq. Dept. 





Liurary E— Acquisitions Department --Receiving 





Pull & date 
8 tamp grn. 
ord. form 



Sign state 



voucher 




A *• 61 

Lilrury E« -Acquisitions Department- -Receiving 




Note price 
on grn. form 




A - 62 

Library E— Acquisitions Department- 




State X y es 
contingency > — 



-Receiving 



Adjust 

bookkeeping 



Send grn. 
voucher to 
Univ* Acct. 



A - 63 



Library E— Acquisitions Department— Receiving 



Discard grn, 
voucher 



File yell, 
voucher in perm, 
voucher file 




A - 64 

Library E— Catalog Department— LC Card Ordering 




Shelve bk« and 
forms on "curr* 
ggd." shelves by 



File pink & yell, 
Slips yin "Wait- 
ing to Order" .« 
LC cds • . v 



1 8end yell* to 
LC (when 








A - 65 

Library E— Catalog Department— LC Card Ordering 




i 




Stop 



o 

ERIC 



A * 6 .0 

Library E— Catalog Department— LC Card Receiving 






Start 

r 




File slips in 
reorder file 




Library E— Acquisitions department— Out of Print 



Start 



) 




A - 60 

Library E- -Acquis it ions Department --Out of Print 




VCc» 



Type ai d 
send list 



Stop 



A • 69 

Lilrar y ?• -Acquis it ions Department- •Ordering 







70 



A ' 



Library F— Acquisitions Department— Ordering 




o 

ERIC 




Note call 



Keen. req. 
to req'r. 





A - H 

Library F" Acquis it ions Department "Ordering 




a - n 

Library F— Acquisitions Department •-Or daring 








Encumber fund 






Note buff cd, 
"P" for posted 




L7~ 



i File buff cd. 
i in source (Vendor), 

I file j 

• * 

! ' i 



Stop 



A 

Library F* •Acquisitions 



73 






yes 



f 



A - 74 

Library F- -Acquis it ions Department-Receiving 




Shelve bk. & 
slips in new bk. 
sect .for Ref. 
Libn. rev. 



Bks., slips, etc. 
Cat. Dept. 





A - 75 „ J . 

Library F*»Acquisitions Department ••Receiving 






A - 76 

Library F— Acquisitions Department— Receiving 




no 



4 



Send vouc 
c's invoi 
Libn's. C 

signature 


:h. & 2 
Lee to 
)ff. for 

fc 







Send vouch. & 2 
invoices to Acct. 



ip 



ile 1 c. of in- 
voice in Lib. 
remp.Acq. invoice 
file 



yes 



o 

ERIC 



When invoice 
appears on Adm. 
lept. pull invoice 



_ i:z~ 



File invoice 
in master invoice 
file 




Type 

encumbered 
amt. & "carry 
over" no^_ _ 



Stop 
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Current Operations Questionnaire 



This questionnaire was compiled to learn the cost and 
promptness of present technical services operations preparatory 
to comparing them with those of the NELINET center • When all 
completed questionnaires have been returned, the data will be tab- 
ulated. 

Present plans call for preparation of a journal article 
based on the data. 




Dear 



The enclosed questionnaire is designed to help the 
New England State University librarians assemble data on 
the efficiency and costs of current processing operations. 
In particular, data are being sought about those tasks 
for which the NELINET operation will soon offer alterna- 
tives, catalog card and book label production. The use 
of uniform criteria for these data is important not for 
the questionable comparisons of the six libraries that 
become possible, but because it will allow the evalua- 
tions of NELINET services to be compared. 

The following are kinds of figures these data will 
make possible: 

1. For every dollar spent for library materials, 

x cents are spent for technical services salaries. 

2. The average time from request of a recent American 
imprint until it is ready for shelving is: 



ordering days ; 

cataloging days : 

dealer days ; 

total days . 



3. Cost per title for card production. 

4. Cost per volume for lab*?? production. 
Thank you for your cooperation. 

Sincerely yours, 



James E. Agenbroad 
Library Systems Analyst 
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INSTRUCTIONS 

If you do not wish your library to be identified in 
any subsequent publication of these data you may so specify. 

Where information is unavailable or would require 
excessive labor for compilation, please give an estimate and 
indicate it by a 

Please return one completed copy of the questionnaire 
with the used work sheets and list of totals obtained in section 

IX to Inforonics by February 1^ 1968. 

The second copy of the questionnaire is for your 

records . 

Sections I-V 

It is extremely desirable that salary, book fund 
and book collection data have the same basis. Thus, if 
technical services are provided for state colleges whose book 
funds and collection size statistics are not included, salaries 
for these people should be excluded; if a branch library does 
its own ordering and/or cataloging, exclude their book collection 
and book fund data unless salaries of the branch processing 
staff are included with those of the main library; and if 
reclassification constitutes a significant proportion of the 
catalog dept, operation, the salaries spent for it should be 
excluded since reclassification is not part of the book fund or 
collection growth data. Please use data for the year ending 
June 30 , 1967 . 

Sections VI I I- IX 

In these sections data on card and label production 
for reclassification and for state college processing may be 
included or not as is more convenient as long as the same pro- 
cedure is followed for all three kinds of data: supplies, labor 
times, and production quantity. 

Extra work sheets are available. 
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Library 



I. EXPENSES 

1.1. Total library expenses (staff, books, equipment 

and supplies) for 7/66-6/67: 

1.2.1. Total funds for library materials: 

1.2.2. If this included binding, how much for 

binding? 

1.2.3. If this included subscription renewals, how 

much for subscription renewals? 

1.2.4. If this included a fund for new subscriptions, 

hew much for new subscriptions? 

1.3.1. How much for new furniture and equipment? 

Total library , catalog dept. f 

order dept. . 

1.3.2. How much for maintenance and repair of 

furniture and equipment? Total librar y > 

catalog dept. > order dept. • 

1.3.3. How much for leased equipment? Total 

library , catalog dept. > order 

dept. • 

1.4. How much for supplies? Total library 

catalog dept. , order dept. • 

II. PHYSICAL PLANT 
Estimated replacement cost of: 

a. Library with entire contents 

b. All equipment and furnitur e 

c. Catalog dept, equipment and furniture 

d. Order dept, equipment and furniture__ 

e. Card and label production equipment and furniture 



III. PERSONNEL AND SALARIES 



III.l. Total 


Library staff : 




Librarians 


man years 


cost 


Clerical 


man years 


cost 


Student 


hours 


cost 



ERIC 
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I I I. 2. Catalog Dept. Staff 






Librarians 


man years 


cost 




Clerical 


man years 


cost 




Student 


hours 


cost 




I I I. 3. Order Dept. Staff (Please 


include bookkeeping 


even when 


done elsewhere 


in the library) 






Librarians 


man years 


cost 




Clerical 


man years 


cost 




Student 


hours 


cost 




III. 4. Head of 


technical processing and immediate 


staff : 


Librarians 


man years 


cost 




Clerical 


man years 


cost 




Student 


hours 


cost 




I I I. 5. Other technical processing staff (binding, 


serials, 


reclassification, archives, maps, 


govt . docs . , 


processing for state colleges, etc.) 






Librarians 


man years 


cost 




Clerical 


man years 


cost 




Student 


hours 


cost 



What areas does this include? 

III. 6. Administrative staff of library: 



Librarians man years cost 

Clerical man years cost 

Student hours cost 

I I I. 7. Hours per week: librarians clerks 

Weeks per year: librarians clerks 

Paid holidays per year . 

IV. COLLECTION SIZE 

1. Total in library on June 30, 1967 vols; titles 

2. Does this include govt, docs? 

3. Does this include microforms? 

4. Serials vol». titles 

V. COLLECTION GROWTH 7/66-6/67 

1. Added vols; titles; 

2. Cataloged by staff in III. 2. vols; titles; 
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If V.l and 2 differ, why? (Uncataloged documents, gifts, etc.) 



3. 

4. 

5. 

6 . 



Cataloged for branches 

Serials added vols. 

Gifts added .vols. 

Reclassified vols. 



vols. titles. 

titles. 

titles. 

titles. 



VI. 

Dept. 

Stack space 
library 



VI. SPACE 

Square feet devoted to the following: Catalog 

; Order Dept. ! Administrative 



All staff work space 



Total 



VII. OPERATION TIMES 



VII. 1. Select randomly from books going from 



processing to the stacks 30 recent American trade titles and 
supply as much of the following data for each as possible. 



A. Date B. Date 
requested order 
by faculty sent out 



C. Date D. Date E. Date 
book rec*d sent to sent to 
or paid for cat. dept, stacks 



1 . 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 



A 



B 
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C 



D 



E 



17. 

18. 

19. 

20 . 

21 . 

22 . 

23. 

24. 

25. 

26. 

27. 

28. 

29. 

30. 



71 I. 2. Select randomly from cards just typed and 
about to be arranged for filing into the catalog 30 main entries 
for recent American trade titles and supply as much of the fol- 
lowing data as possible: 

A. Date B. Date C. Date D. Date E. Date of 

requested order book rec*d sent to this test 

by faculty sent out or paid for cat. dept. 



1 . 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 

12 . 

13. 

14, 
o 

ERIC 
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A B C D E 

14. 

15. _____ 



17. 

18. 

19. 

20 . 

21 . 

22 . 

23. 

24. 

25. 

26. 

27. 

28. 

29. 

30. 



VIII. SUPPLY COSTS 

VIII. List actual or estimated supply costs for card 
and label production during 7/66-6/67. Please add items omitted 
from the list. Where supplies are also used for other purposes 
(typewriter maintenance for instance) estimate the proportion 
used for card and label production. 

1. Card stock 

2. L.C. cards 

3. L.C. proof slips - 

4. Pressure sensitive labels , 

* . Polaroid film 

6. Xerox supplies 

7. Xerox maintenance 

8. Flexowriter maintenance 

9. Flexowriter supplies 

10. Typewriter mfH ntonanoo „ 

11. Selin supplies 




12. Xeroj; rental 

13. Snopake 

14. Other, specify. 

15. . 

16. 



IX. LABOR TIME, COSTS, AND PRODUCTION 
IX. 1.1. Procedure: 1. For one ’’normal” week, have all 
staff involved in production of cards and labels fill out the 
daily work sheets indicating the minutes they spend at each tas 
It should be stressed that all the time devoted to producing 
cards and labels is to be included but that the time spent using 
the final product (suoh as iiling catalog cards and pasting 

labels!) should be excluded. 

IX. 1.2. At the end of the week arrange the sheets 

according to the salary of the workers. 

IX. 1.3. Add together the minutes spent at each 

task by all individuals earning the same salary and working the 
same hours/week/ year . 

IX. 1.4. List these totals on a separate sheet in the 



format : 

TASK 


MINUTES 


SALARY 


PER 


A1 


360 


1.75 


hr. 


A1 


2400 


3,000 


yr. 


IX. 2. 


In a locally convenient 


manner, obtain data 



on card sets produced and volumes labelled during the above 



week: sets; vois. 

X. L.C. COPY 

Proportion of acquisitions for which L.C. cataloging 
(L.C. cards, proof slips, title II cards, PW, or LC entries in 
NUC) is not used (not available or too slow) . 

— If L.C. cards are ordered, how long until receipt of the 

bulk of an order? __ - 
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XI . OVERHEAD 

XI. 1. If any of the following are excluded from the 
library expenses in I. above but are incurred by operation of 
the library , please list or estimate below the amount spent 
during 7/66-6/67. (For instance , if the president, vice president 
and treasurer devote one half their time to fund raising and 
four per cent of the funds raised are applied to library oper- 
ation, then two per cent of their salaries are spent to acquire 
operating funds for the library.) 

XI. 2. If a figure used by the university to compute 
the overhead costs on applications for federal research grants 
(at one university an additional 40% of the salaries of the 
researchers) is an accurate indicator of the library overhead 
(covers all the items listed below) write it here and explain its 
use: 



If such a figure exists, but does not reflect library 
overhead, write it here; explain its use and then give data for 
items not covered below. 

XI. 3. Fringe benefits to library staff (if overall 
fringe benefits per person or per faculty and per clerical 
figures are available just supply them.) 

a. University retirement contribution 

b. University health insurance contribution __ 

c. University life insurance contribution 

d. Travel expenses paid to staff 

e. Relocation grants to new staff 

f. Tuition grants to staff and their families 

g. Staff use of University subsidized clinic 

h. Staff use of University subsidized housing 

i. Other, please specify 




XI. 4. Utilities: 

a. Heat 

b. Air conditioning 

c. Electricity 

d. Water 

e. Telephone 

f. Postage 

g. Other, specify 

XI 5. Services: 

a. Messenger 

b. Payroll and Billing 

c. Accounting and Auditing 

d. Janitorial 

e. Architectural and Interior Decorating 

f. Equipment purchase handling 

g. Printing 

h. Grounds 

i. Repairs to building and equipment 

j. Data processing, programming 
and computer time 

k. Personnel services such as typing tests 
for clerks 

l. Fund raising by alumni office and 
university administration 

m. Proportionate share of general university 
overhead such as news bureau, fire 
insurance , etc . 

n. Other, specify — 



XII. QUESTIONNAIRE 

XII. 1. Estimated time and cost of completing 

questionnaire. — 

XI 1.2. Comments on this questionnaire. 
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TECHNICAL MEMORANDUM NO. 261 



TO; 


NEBHE File 


FROM: 


William R, Nugent 


SUBJECT; 


An Algorithm for Splitting the LC Call Number 


DATE; 


February 3, 1968 




1 . INTRODUCTION 



Man has been able to successfully split the atom and 
the infinitive, but the LC Call Number still defies simple solu- 
tions. A new algorithm is presented here. It works on Ann 
Curran’s list of 39 examples (7M 233-9/26/67) and on LC’s list 
of 6. These examples and the resulis of applying the algorithm 
to them are contained in pages C8 - C13 of this memo. The algorithm 
is a multiple pass process that first identifies kinds of elements 
in the linear string form of the call number, and then formats 
these according to their identity and the maximum column length. 
Sequences of alpha characters, numeric characters, and punctuation 
are used as a guide to identification and division. Beginning and 
end of line punctuation i» generally dropped where item division 
is its only function, because end-of-line is itself a strong divi- 
sion, such punctuation is therefore redundant in column form, and 
conservation of space is achieved. 

The Curran and LC examples are shown to illustrate the 
algorithm; interested staff members are invited to try it on other 
examples and determine what circumstances, if any, would cause 
the algorithm to incorrectly split. 

2. ALGORITHM 

The linear string of input symbols is taken in strict 
sequence from left to right and up to 8 classes of items are 
identified and extracted. The extracted items are then examined 
and divided where necessary to maintain a 6 character column. 

The items are then further scanned to concatenate certain short 
items occupying separate lines. The process, therefore, has 4 
parts : 



1. identification 

2. extraction 

3. division 

4. concatenation 
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2.1 Identification and Extraction 



Eight classes of items are identified , which wo will 
refer to by their class numbers: 

1. Alphabetic portion of class number 

2. Initial numeric portion of class number 

3. Decimal fraction portion of class number 

4. Book number (initial alpha numeric portion) 

5. Year designation 

6. Alpha numeric designator (part numbers, etc.) 

7. Other 

8. Initial alphabetic designator (e.g., microfiche, 
incunabula, .etc.) 

Identification is performed on the basis of matching 
the first and successive characters of the input with a specific 
form of symbol string. The input string is identified successively 
left to right, and as one identification is made, the portion 
identified is extracted and tagged, and the identifier then works 
on the first and successive characters remaining in the input 
string. Tests for certain classes may be repeated twice, but 
they are not returned to later. Figure 1 shows the flow chart of 
the test sequence. The particular tests are described here sub- 
sequently. 



The identification method seeks a predictable order of 
items, and when it correctly identifies these items, as it generally 
does, the resulting item partitioning is itself usually sufficient 
for determining column content. When identification is correct 
and the item is overly long, the identification helps in determining 
its final format. When a first unruly item is detected, the whole 
call number is assumed unruly and this and successive items are 
placed in class 7 ("other'*), and further partitioning may be neces- 
sary. 



The following definitions are used, defined via Backus 
conventions. Non-primitives are enclosed in triangular brackets, 
the vertical bar means "or" and the overbar means "not". 

<character> : : = <alpha> | <numeri£> |<jpunctuaticn> | 

<spac£> 

<alpha^> : *<^> : * A |b |c | • • • |z la |b |c |. • • |z| 

<£umeric^> : = <5v > • m 0 1 1 1 2 1 . . . . 1 9| 

<space> : - <□> 

<$>unctuatioq>: «<£>: « |<slash>|<comma>|<period>|<hyphen>| 
<other code> 

<£nd of call number>: ®<e >:= as determined by format and 
context 
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<slasfi> : ** / 

<comma> : - , 

<perio<|> : * . 

<other codq> : - <p^|< 3 ^|/| 7[ • 



To define the identification criteria and specify the 
symbols to be extracted, we will require sequence operators. We 
use conventions similar to Kleene’s, where each symbol is a 
defining sequence, or parenthetical expression in a defining 
sequence, represents a descrete and necessary occurrence. The 
following conventions apply: 

d /3 i m initial alpha immediately followed by beta 
I : - Identical character (universal) 



d I S 
cj/S <d 



ol/3 o I 



♦ 



oM 
o( /S 



: - initial alpha immediately followed by another 
character immediately followed by beta. 

: - initial alpha immediately followed byt7betas 
immediately followed by alpha* 

: - initial alpha immediately followed by zero or 

more successive betas immediately followed by alpha, 
e.g. , o/dl oro< orol &&&$, etc . 

: - S | . . .etc. 

: = initial alpha immediately followed by a character 
not beta 




: * initial alpha immediately followed by a character 
either beta or gamma. 




't) 6 % 



» null character 

-of /S X\/3# : = initial character may be c /5 % 
followed respectively by/2, and . 



The identifying sequences, and extracted sequences are 
shown in Figure 2, In operation, the first character of the in- 
put string and its following symbols (or the first character of 
the input string after the last extracted item) must exactly match 
the class specification sought in the program flow of Figure 1. 

When a match is obtained, a portion of the matched data is ex- 
tracted, as su^cified in the extraction list (Figure 2). Pro- 
gressing through the flow of Figure 1, the next sought-for class 
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Identity Starting with First Characters Extracted from 
Class Remaining Character Identified Portion 



1 




old* 


2 


Vnjg) 


X 7V 


3 


• A A. 


H l&) 


\n_* 


4 


(. |y) d-d * a* ( n. | e) 


old* -a* 


5 






?L 4 


5a 


(<^M 2 ) (□ 




oM* 


6 


p*^*(p 


p □ | Y ) ?\ V fn |e> 


old* (p IpuI'D^tl* ("1 


7 


P*P !*<□ 


e ) 


- T* 

* P I 


8 


c*<**(p | 


old* 



Item Classes, Identifying Sequences, and Characters Extracted. 

FIGURE 2 



t 
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specification is matched against the input character string start- 
ing with the character after the last extracted character. What 
we are doing , in brief , is identifying an item in terms of its 
possible forms and necessary pre— and post-delimiters, and then 
extracting the item only. 

Item 6 will serve as an illustration for the rest, 
which are simpler. We identify a class 6 item by a sequence 
consisting of zero or more punctuation characters; one or more 
alpha characters, either punctuation, or punctuation space, or if 
neither of these exist then the character specification is null 
and the considered input symbol must match the next specification 
symbol; one or more numeric characters; an end of record or 
character not numeric. When this specified sequence is found, 
the extracted portion of it consists of the sequence less initial 
punctuation, if any, and through the final numeric portion, in- 
cluding at the end a hyphen if such should exist as the terminating 
n character. 

The remaining 7 items are considerably simpler. As each 
item is identified and extracted, it is put in a list and identified 
with a class label. Successive items of the same class are kept 
discrete, and appropriately labelled* The item lists resulting 
are illustrated for the examples in pLgec CO - C13 of this memo. 

2.2 Division 

Items in the list are then examined for length, and 
where items exceed 6 characters they are divided into 2 items of 
the same class. Rules for division are somewhat class— dependent, 
but use subsets of the same rules in the same order . 

2,2,1 Rules 

Rule 1: Start with 7th character and work back to 1st 

character, and break on first encountered space. 
If none, repeat and look for slash, if none 
repeat and look for comma. When a break point 
is found eliminate the space, slash, or comma 
from the item, and carry the remainder of the 
item forward as a new item of the same class, 

n 

Rule 2; Check if item begins cj , if so replace 6th 
symbol with a hyphen and make new item from 
symbol 6 onward. 

Rule 3: Check if item begins^*. 7^ rv " and if so 

check if portioned A is 6 characters, 
if so, make a new item starting with n. If 
portion^ , "^1 .A? is less than 6 characters, 
check if n: - hyphen and begin new item with 
character after n if so, or with n if not so. 
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Rule 4: Check from 7th character back for p symbol not 

period, if found, break, dropping the p symbol. 
If not found repeat and look for period, if 
found, break dropping the period unless It was 
proceeded by an Oi character. 

Rule 5s Break after 6th character. 

The rules are applied in the above order when an item 
is to be split. Rule 3 is only used in splitting items of classes 
4, 6, 7 and 8. Rule 2 is only used for classes 7 and 8. However, 
no harm is done by applying these rules without exception to itpms 
of all classes - it is merely a waste of time. The exigencies of 
program construction can determine the preferred method, 

2.3 Concatenation 

After identification and division have been performed, 
one or two very short items may exist that are properly combined. 
Application of the following rules provides this: 

Rule 6: Check if successive items of class 2 and class 

3 both exist, if so, add a period after the 
class 2 item add concatenate the class 3 item 
if the total is 6 characters or less. 

Rule 7: Check for class 7 item with single character, 

concatentate into preceeding class 4 or class 
7 item, if such exists and total character 
length does not exceed 6, 

Rule 8: Check continguous class 7 - class 7 pairs, 

and concatenate with intervening space if 
total does not exceed 6 characters, 

3 . CONCLUSIONS 

The algorithm presented is a multiple pass process and 
tho oame rules could probably be incorporated into a single pass 
system, at no loss in accuracy and no necessary gain in simplicity. 
The algorithm dooe not result in a simple program. It works on 
45 difficult examples. Simpler solutions should be tested on the 
same examples. Additional difficult examples should be tested 
with this algorithm. 





tmd 



c 



o 



* o 



Examples of Algorithmic 

1 . 



QL678.A45 

Class Items Format 



1 


QL 




2 


678 


same 


4 


A45 




2. 


HF5543.2.A72 




Class 


Items 


Format 


1 


HF 


HF 


2 


5548 


5540 . 2 


3 


2 


A72 


4 


A72 




3. 


HF5548.276A72 




Class 


Items 


Format 


1 


HF 


HF 


2 


5548 


5540 


3 


276 


.276 


4 


A72 


A72 


4. 


Z7164.E2A57 




Class 


Items 


Format 


1 


Z 




2 


7164 




4 


E2 


same 


4 


A57 




5. 


E467 


. 1.A2A15 




Class 


Items 


Format 


1 


E 


E 


2 


467 


467.1 


3 


1 


A2 


4 


A2 


A 15 


4 


A15 





Call Number Division 

6 . 



HF1456 1964. C6 
Class Items Format 



1 


HF 




2 


1456 


same 


5 


1964 




6 


C6 




7. 






BS191 


.A1.1952.N4 




Class 


Items 


Format 


1 


BS 


BS 


2 


191 


191 


4 


A1 


A1 


7 


1952. N4 


1952 






N4 


O 






o • 






JX636 


1892 1955, 


no. 21 


Class 


Items 


Format 


1 


JX 




2 


636 




5 


1892 


same 


5 


1955 




6 


no. 21 




9. 






GA408 


1751. F7M3 




Class 


Items 


Format 


1 


GA 




2 


408 




5 


1751 


same 


6 


F7 




6 


M3 




10. 






RD31. 


A425 1965 




Class 


Items 


Format 


1 


RD 




2 


31 




4 


A425 


same 


5 


1965 






c 






11 . 



QK647.A35 1964c 
Class Items Format 



1 


QK 




2 


647 


same 


4 


A3 5 




5 


1934c 





12. 






JX1975.A49 1923hd 


Class 


Items 


Format 


1 


JX 




2 


1975 


same 


4 


A49 




5 


1923bd 




13. 






J87. 


M4 1958g no 


.2980 


Class 


Items 


Format 


1 


J 


J 


2 


87 


87 


4 


M4 


M4 


5 


lC53g 


1958g 


6 


no. 2988 


no. 






2988 


14. 






QK1. 


U5 vol. 35, pt. 6 


Class 


Items 


Format 


1 


QK 




2 


1 




4 


U5 


same 


6 


vol. 35 




6 


pt.6 




15. 






Q11.P6 N.S. ,vol 


,55pt.3 


Class 


Items 


Format 


1 


Q 


Q 


2 


11 


11 


4 


P6 


PS 


7 


N.S.,vol. 


55pt.3 N. 



vol . 55 
pt.3 



IS. 



D1.T68 t.l4,zesz.4 
Class Items Format 



1 


D 




2 


1 




4 


TS8 


same 


S 


t .14 




S 


zesz.4 




17. 


B5 [no. 15] 




DA25, 




Class 


Items 


Format 


1 


DA 




2 


25 


same 


4 


B5 




6 


no. 15 




18. 






E51.H337 vol.57/no.l. 


Class 


Items 


Format 


1 


E 




2 


51 




4 


R377 


same 


S 


vol. 57 




s 


no. 1 




19. 






Z733 


.C531J 




Class 


Items 


Format 


1 


Z 


Z 


2 


733 


733 


4 


C531 


CS31J 


7 


J 




20. 






ZS96 


.U5H 1959 




Class 


Items 


Format 


1 


Z 


Z 


2 


696 


396 


4 


U5 


U5H 


7 


H 


1959 


7 


1959 
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21 . 



HJ9013.C6b 




Class 


Items 


Format 


1 


HJ 


HJ 


2 


9013 


9013 


4 


C6 


C6b 


7 


b 




22. 






PB2750.Bo7 1958 




Class 


Items 


Format 


1 


PR 




2 


2750 




4 


B 0 7 


same 


5 


1958 




23. 






PZ3. 


C3933Br 




Class 


Items 


Format 


1 


PZ 




2 


3 




4 


C3983 




7 


Br 




24. 






PZ3. 


C4637Cas2 




Class 


Items 


Format 


1 


PZ 




2 


3 




4 


C4637 


same 


6 


Cas2 




25. 






PZ3, 


, D89Th 80 




Class 


I % ems 


Format 


1 


PZ 


PZ 


2 


3 


3 


4 


D09 


DC9 


7 


Th 


Til 30 


7 


30 





26. 



PZ3 . C86153Ghf c 2 
Class Items Format 



1 


PZ 


PZ 


2 


3 


3 


4 


C86153 


C86153 


7 


Ghfc 


Ghfc 2 


7 


2 




27. 






PZ7. 


B4656Wav 




Class 


Items 


Format 


1 


PZ 




2 


7 




4 


B4656 


same 


7 


Wav 




28. 






PZ3 , 


,S38An 




Class 


Items 


Format 


1 


PZ 




2 


8 




4 


S38 


same 


7 


An 




29. 






PZ0, 


: l.H75IWat 




Class 


Items 


Format 


1 


PZ 


PZ 


2 


8 


8.1 


3 


1 


H75 


4 


H75 


IWat 


7 


IWat 




30. 






PZ10 . 3 . S837No 




Class 


Items 


Format 


1 


PZ 


PZ 


2 


10 


10.3 


3 


3 


S337 


4 


S337 


No 


7 


No 
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31. 



PZ10 • 3 . D632Lad4 
Class Items Format 



1 


PZ 


PZ 


2 


10 


10.3 


3 


3 


D632 


4 


D632 


Lad4 


6 


Lad4 




32. 


JX1977.A2 S/3111. 


etc. 


Class 


Items 


Format 


1 


JX 




2 


1977 


same 


4 


A2 




6 


E/3111 




7 


etc. 




33. 


JX1977.A2 E/CN.6/341/rev.l 


Class 


Items 


Format 


1 


JX 


JX 


2 


1977 


1977 


4 


A2 


A2 



7 E/CN. 6/341/rev. 1 E/CN.6 

341 

rev.l 

34. 



36. 

Microcard PS3545.Y45 



Class 


Items 


Format 


0 


Microcard 


Micro- 


1 


PS 


card 


2 


3545 


Ps 


4 


Y45 


3545 

Y45 


37. 


Microfilm 01104 


no. 93-94 AP 


Class 


Items 


Format 


0 


Microfilm 


Micro- 


7 


01104 


card 


7 


no. 93-94 


01104 


7 


AP 


no. 93- 

94 

AP 


30. 


Microfilm 01291 


reel 63, no .IE 


Class 


Items 


Format 


0 


Microfilm 


Micro- 


7 


01291 


film 


7 


reel 


01291 


7 


63 


reel 


7 


no. IE 


63 

no. IE 



ML29c.M5 (music) 39. 

Class Items Format Incun. 1494. L3 Rosenwald Collec- 



tion 



1 


ML 




Class 


Items 


Format 


2 


29 


same 








7 


c.M5 




0 


Incun. 


Incun. 








7 


1494. L3 


1494 


35. 






7 


Rosenwald 


L3 


G3701S.C5 var.U5 


(map) 


7 


Collection 


Rosen- 


Class 


Items 


Format 






wald 












Colle- 


1 


G 








ction 


2 


3701 


same 








7 


s.C5 










7 


var.U5 
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L.C.l 



JX1977.A2 St/SOA/40 
Class Items Format 



1 


JX 


JX 


2 


1977 


1977 


4 


A2 


A2 


7 


St/SOA/40 


St/SOA 


L.C.2 




40 


UG633.A3763 no. 62 


-5 1960 


Class 


Items 


Format 


1 


UG 




2 


633 




4 


A3763 




6 


no. 62- 


same 


7 


5 




7 


1930 




L.C.3 


J85 


1951. C9j 82d 


,1st no. 17 


Class 


Items 


Format 


1 


J 


J 


2 


85 


85 


5 


1951 


1951 


6 


C9 


C9j 


7 


j 


82d 


7 


32d,lst 


let 


7 


no. 17 


no. 17 


L.C.4 


DT333 


•S34 47.Jahrg 


. ,Heft Nr. 8 


Class 


Items 


Format 



1 


DT 


DT 


2 


333 


333 


4 


S34 


S34 


7 


47. Jahrg. ,Heft 


47 


7 


Nr. 8 


Jahrg 
Heft 
Nr. 8 



L.C.5 



Z2407.B no. 32, 


etc. 


Class 


Items 


Format 


1 


Z 




2 


2407 


same 


7 


B 




7 


no. 32, 




7 


etc. 




L.C.6 


BX2234. 


M56 1565 Rosenwald Coll 


Class 


Items 


Format 


1 


BX 


BX 


2 


2264 


2264 


4 


M56 


M56 


5 


1565 


1535 


7 


Rosenwald 


Rosen- 


7 


Coll. 


wald 

Coll. 
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Variations with respect to model examples in Ann Curran* s TM-233 



(8) JX 
636 
1892 
1955 
no .21 



example shows "1955, though final commas 
are missing in both example and algorithmic 
versions in the case of nos. 14 and 16 — 
difference assumed trivial. 



(17) DA example includes dropped indentures "[no. 15]". 

25 — difference assumed trivial. 

B5 

no. 15 



(18) E example has "vol.57/" slash conveys no informa- 

51 tion —difference assumed trivial. 

H337 
vol . 57 
no.l 



(31) PZ example has 'I* ad 4 ", though input sequence 

10.3 has no space — difference assumed trivial. 

D632 
Lad4 




APPENDIX D 




Analysis of the BIARC II Format 

The BIARC II report defines format as "the structure , 
content . and coding of a record. The structure will provide the 
framework for incorporating both fixed and variable length fields 
within the record. The content is the data recorded in these 
fields. The coding is the machine representation of the character 
set."* 



Another significant characteristic of machine formats, 
in addition to the structure, content, and coding, is the identifi- 
cation of the data in the record. In this appendix the structure, 
content, and identification of data in the BIARC I and BIARC II 
formats will be compared. The implications that the changes in 
the BIARC II format have on the NELINET processing of Library of 
Congress data and the generation of its own cataloging records will 
also be noted. 

Structure 



The BIARC I format contains a fairly long fixed length 
field followed by a number of variable length fields. Each variable 
field contains a logical segment of the catalog card data preceded 
by six characters which contain the length of the field and the 
identifying tag for the field. The collation statement for a book 
would be found in the record as: 



v 



10 



6 



4 



0 



6 



J L 



V 

4 

tag for the 
field, 40 * 
collation 



15 



20 



u 



. I 



» i * 



7 



25 



m 



V 

data 



Length of 
this field 



1 Avram, Henriette D. , Knapp, John F. and Rather, Lucia J., 

The BIARC II Format. A Communications Format for Bibliographic 
Data. Preliminary Edition. Washington, Library of Congress, 
December, 1967. 
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The MARC II format consists of three parts - the leader, 
the directory, and the data fields. (See Figure D-l.) The leader 
is a short fixed length field which contains certain information 
about the machine record. The directory is made up of fixed length 
fields containing the identifying tag, the length of the correspond- 
ing data field, and its starting position relative to the beginning 
of the record. The data fields may be of fixed or variable length. 
The first two characters of each data field are alio ted to indicators 
which further identify or describe the data field. In a MARC II 
record the collation data in the above example would appear in the 
directory as: 



3 1 0 

V 



v 



0 10 I 0 I 2 L lo lo |5 I 4 1 7| 

1 ' V 1 No. 6f characters from start of 

record to start of collation data, 
547 is hypothetical. 






Length of 
data field 



Tag 300 « 
collation 



and in the data field as: 

| | |1 jo |6 I |p | . |$t 11 1 111 ul S I . I $ 12 |7 | | a I . I 

1 — V 1 

cators Data 

(^delimiter, a non printing separator of subfields) 

(eof « end of field mark) 

The NELINET format consists of two parts - the directory 
(map) and the data. Each entiy in the directory is of fixed length 
and contains an identifying tag for a field (item) in the data and 
the position of the first character of the field relative to the 
beginning of the data. Data fields can be of fixed or variable 
length . 



The structure of the MARC II format is more similar to 
NELINET than was MARC I. As far as structure is concerned, MARC II 
and NELINET are highly compatible, both being "mapped" records. 
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MARC II - Tape Structure 



Field 1 


Field 2 


Field 3 

Variable 

Control 


Tien n 

Variable 

Fixed 


Field b 

Variable 

Data 


Leader 


Directory 


Number 


Fields 


Field 1 



Field & 
Variable 
Data 
Field 2 



Leader 



i 

] 

] 



° 






i 

i 




Directory 




Variable Data Field 





FIGURE D-l 



AAAAAA 



Content 



Very little MARC I data has been eliminated from the MARC II 
format. The codes for publisher and city of publication are not 
included but have been replaced by the publisher's prefix in the 
Standard Book Number and the country of publication code respectively. 
The code to indicate that a work is a grammar has been omitted. 

A considerable amount of new data has been added to the 
MARC II record. Some of this new information is contained as codes 
and indicators in the variable fixed field. Tag 001, and some is 
contained as new variable data fields, e.g., the National Library of 
Medicine Call Number. An itemized listing of the MARC II fixed and 
variable fields along with an indication of the new fields not in 
MARC I are presented in Table D-l. 

Identification of Data 

In the MARC II format many of the data fields are more 
precisely identified and described than they were in MARC I. Most 
of these additional distinctions are made to facilitate machine 
filing according to the Library of Congress filing rules. The 
distinctions among personal, corporate, and conference name entries 
as well as those indicating filing treatment for personal and 
corporate name added entries greatly increase the number of different 
tags in the MARC II format. The fixed and variable fields which 
are identified differently in MARC II than they were in MARC I are 
shown in Table D-l. 

Table D-l lists the data elements identified by position 
in the fixed field and by a tag given to a variable field. In the 
MARC format data elements within a variable £ield are also identified. 
Usually a special character called a delimiter separates the sub-ields 
contained within a field. A personal name entry, for example, may 
contain four subfields - name, title, identifier, and relator. Un- 
like MARC I, MARC II indicates vacant subfields. The changes in 
internal identification within tagged fields are shown in Table D-2. 

Implica tions for the NELINET Processi ng Center Programming 

The changes in the MARC II format will mean almost total 
reprogramming of the card production programs. In addition to the 
new data in MARC II and the differences in identification of data 
there are other changes in the MARC II record which will require 
extensive changes in the programs. Title added entries are specified 
by an indicator in the title statement field rather than b., a separate 
tag as they were in MARC I. Series entries in which the author of 
the series is the main entry are to be generated by combining the 
main entry with the last part of the series statement rather than 
from data in one field. Such techniques are better suited to a 
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machine based system than were the practices of MARC I but will 
require considerable programming changes to accommodate them* 

The decision has, therefore, been made to delay such 
reprogramming until M/.RC II has been firmly established and has 
been running for some time* This delay will also provide the op- 
portunity of gaining more operating experience with the present 
programs to see how they might be improved* In order to P ro ^^® 
continuous service after the Library of Congress changes to MARC II, 
a conversion program will be written which converts the Library of 
Congress MARC II data tapes to the NELINET MARC I format* These 
tapes can then be run through the existing card and label production 
pro rams and NELINET card production demonstration services need not 
be interrupted* The conversion from MARC II to NELINET master file 
format is a much simpler program to write than a new card production 
program and could be written before the Library of Congress starts 
to issue MARC II tapes in July, 1968* 

Data Creation 

The tagging of data for a MARC II record will be much 
more difficult than MARC I tagging. One would have less confidence 
in anyone but a trained cataloger doing or revising it. The 
distinctions made among personal and corporate name added entries 
to indicate filing treatment for instance, will require knowledge 
of the filing rules. Identifying personal names as forename or 
surname entries will at times be difficult especially when dealing 
with foreign names. Distinguishing geographic names from political 
jurisdictions will sometimes be difficult as will determining soae 
period subdivisions* More time will be required to tag the record; 
more time will be required to train people to tag properly; more 
errors can be expected in the tagging; and more correction time will 
be required before "clean" records are obtained. 

As originally planned, the six participating libraries 
would send original cataloging or Library of Congress non-MARC 
cataloging to the center and the center would tag and key it. *ome 
of the tags in the MARC II format would be difficult to assign apart 
from the cataloging process. Such tags are: 

(1) The main entry is the publisher. (The publisher 
is omitted on the catalog card both when it is 
the main entry and when it is unknown.) 

(2) The language of the original and the language 
from which the work was translated. (This 
also occurred with MARC I*) 

(3) The book contains an index. 
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Since MARC II will become the standard format for the 
exchange of machine readable cataloging data among libraries, it 
is certainly desirable that others generating machine records use 
it. To do so will require considerable time and sophistication 
on the part of those who identify the items in the data* If the 
NELINET center is to generate complete MARC II records, it can only 
do so with help from the participating libraries. 
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TABLE D-l 

MARC II - Identification of Fields 



<0 

P 

A 

•P 

ro 


Tot 


Tnd* 1 


Data Element — 








Leader 








Record Length 








Record Status, New 








Record Status, Changed 


♦* 






Record Status, Deleted 








Record Status, Old 


** 






Legend, Count 


** 






Legend, Type, Language Materials, Printed (A) 


** 






Legend, Type, Language Materials, Manuscript (B) 


** 






Legend, Type, Music, Printed (C) 


** 






1 Legend, Type, Music, Manuscript (D) 


** 






Legend, Type, Maps, Printed (E) 


** 






1 Legend, Type, Maps, Manuscript (F) 


♦* 






I Legend, Type, Motion Pictures and Filmstrips (G) 


** 






Legend, Type, Microforms (H) 


** 






I Legend, Type, Sound— Recordings , Language (I) 


** 






I Legend, Type, Sound-Recordings, Music (J) 


♦♦ 






I Legend, Type, Pictures (K) 


** 






I Legend, Type, Computer Mediums, (L) 


♦* 






1 Legend, Type, Authority Data-Names (X) 



* Data 
** New 
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TABLE D-l 

MARC II - Identification of Fields 





Bata Element 

Leader (continued) 

Legend 5 Type, Authority Data-Subjects (Y) 
Legend, Bibliographic Level, Analytical 
Legend, Bibliographic Level, Monographic 
Legend, Bibliographic Level, Serial 
Legend, Bibliographic Level, Collective 
Directory 
Tag 

Field Length 

Starting Character Position 
Variable Cortrol No, 

LC Card No. (Prefix, Year, Serial No.) 

LC Card No. Check Digit 
LC Card No. Supplement No. 

LC Card No. Suffix 
Variable Fixed Field 

No. of Entries in Directory 

Date Entered on File (Day, Mo., Yr.) 

Type of Publication Date, Single (S) 

Type of Publication Date, Copyright (C) 
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TABLE D-l 

MARC II - Identification of Fields 




Tag 



Ind 



Data Element 

Variable Fixed Field (continued) 

Type of Publication Date, Not Known (N) 

Type of Publication Date, Reprint (R) 

Type of Publication Date, Multiple (M) 

Type of Publication Date, Questionable (Q) 
Date 1 
Date 2 

Country of Publication Code 
Illustration Code - Illustration (A) 
Illustration Code - Maps (B) 

Illustration Code - Portraits (C) 

Illustration Code - Charts (D) 

Illustration Code - Plans (E) 

Illustration Code - Plates (F) 

Illustration Code - Music (G) 

Illustration Code - Coats of Arms (I) 
Illustration Code - Genealogical Tables (J) 
Illustration Code - Forms (K) 

Intellectual Level Code (Juvenile) 

Form of Microreproduction Code, Microfilm (A) 
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TABLE D-l 

MARC II - Identification of Fields 



Status 


Tag 


Ind* 


Data Element 








Variable Fixed Field (continued) 


** 






Form of Microreproduction Code, Microfiche (B) 


** 






Form of Microreproduction Code, Microopaque (C) 








Form of Content Code, Bibliographies (A) 








Form of Content Code, Catalogs (B) 








Form of Content Code, Indexes (C) 


** 






Form of Content Code, Abstracts (D) 








Form of Content Code, Dictionaries (E) 


** 






Form of Content Code, Encyclopedias (F) 


** 






Form of Content Code, Directories (G) 


** 






Form of Content Code, Yearbooks (H) 


** 






Form of Content Code, Statistics (I) 


** 






Form of Content Code, Handbooks (J) 


** 






Government Publication Indicator 








Conference Publication Indicator 


** 






Festschrift Indicator 


** 






Ihdex Indicator 


* 






Main Entry in Body of Entry Indicator 


** 






Fiction Indicator 



ERIC 
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TABLE D-l 

MARC II - Identification of Fields 




Local System No. 

BNB Classification No, 

Dewey Decimal Classification No.(s) 
LC Call No. 

LC Call No. - Book not in LC 
Copy Statement 
NLM Call No. 

NAL Call No. 

NAL Subject Category No. 

Universal Decimal Classification No. 
Local Call No. 



o 

ERIC 
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TABLE D-l 

MARC II - Identification of Fields 



-p 1 

a I 
v 1 
CO I 


Tag I Ind 


* 


100 0 


♦ 1 


100 1 


♦ 1 


100 2 


* 1 


100 13 


♦* 


100 4 


** 1 


100 5 


** 1 


100 6 


** 1 


100 1? 


* 1 


110 0 


♦ 1 


110 1 


* 1 


110 2 


** 


UO I 4 


*♦ 


110 5 


** 


110 6 


* 


111 |o 


♦ 


111 1 


* 


111 2 


♦♦ 


1 111 1 4 


** 


111 5 



Data Elements 
Variables Fields ^continue* 

Main Entry 

Personal , Forename 

Personal, Single Surname 

Personal, Multiple Surname 

Personal, Name of Family 

Personal, Forename, is Subject 

Personal Single Surname, is Subject 

Personal, Multiple Surname, is Subject 

Personal, Name of Family, is Subject 

Corporate , Surname 

Corporate, Place 

Corporate, Name (Direct Order) 

Corporate, Surname, is Subject 

Corporate, Place, is Subject 

Corporate, Name (Direct Order), is Subject 

Conference, Surname 

Conference, Place 

Conference, Name (Direct Order) 

Conference, Surname, is Subject 

Conference, Place, is Subject 
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TABLE D-l 

BIARC II - Identification of Fields 




** 111 6 
♦ 120 0 
** 120 1 
130 0 
** 130 1 
200 0 
** 200 1 

** 210 0 
** 1210 |l 
** |220 
240 10 
♦ 240 II 
|250 
160 




Data Element 

Variable Fields (.continued) 



Blain Entry (continued) 

Conference 9 Name (Direct Order) 9 is Subject 
Corporate with Form Subheading 
Corporate with Form Subheading! is Subject 
Uniform Title 
Uniform Title, is Subject 
Title ("Uniform," "Conventional," or "Filing") 

Title ("Uniform," "Conventional," or "Filing!) is on LC 
Printed Cards 

Romanized Title 

Romanized Title, Receives Title Added Entry 
Translated Title 
Title Statement 

Title Statement Receives Added Entry 

Edition Statement 

Place(s) 

Publisher 

Blain Entry is Publisher 
Date(s) 

Collation 

Bibliographic Price 
Converted Price 
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TABLE D-l 

MARC II - Identification of Fields 



w 

3 

•P 

(8 

•P 

W 



Tag 




* 400 
*14001 l 

* 400 2 

* 400 3 
** 1 400 1 4 
** 400 5 
** 40016 
♦*40017 

* 408 

* 410 0 

* 410 1 

* 410 2 
** 410 4 
** 410 5 
** 1410 1 6 

* 411 0 

* 411 1 

* 411 2 
** 411 4 



Data Elements 

Variable Fields (continuedT 



Series, Traced, 

Personal, Forename 
Personal , Single Surname 
Personal, Multiple Surname 
Personal, Name of Family 
Personal, Forename, is Main Entry 
Personal, Single Surname, is Main Entry 
Personal, Multiple Surname, is Main Entry 
Personal, Name of Family, is Main Entry 
Personal, Title 
Corporate, Surname 
Corporate, Place 
Corporate, Name (Direct Order) 

Corporate, Surname, is Main Entry 
Corporate, Place, is Main Entry 
Corporate, Name (Direct Order), is Main Entry 

Conf erence , Surname 
Conference, Place 
Conference, Name (Direct Order) 

Conference, Surname, is Main Entry 
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TABLE D-l 

MARC II - Identification of Fields 



Series, Traced, 

Conference, Place, is Main Entry 
Conference, Name (Direct Order), is Main Entry 

Corporate, Title 




Series, Untraced 

Series, Untraced, Series is Traced Differently 
Bibliography Note 
Dissertation Note 
Contents Note (Formatted) 

"Bound T. T ith" Note 
"Limited Use" Note 
General Notes (Ail Others) 

Annotation or Abstract 



600 

600 

600 

600 



0 

1 

2 

3 



Subject Entry 

Personal, Forename 
Personal , Single Surname 
Personal , Multiple Surname 
Personal, Name of Family 




Status 
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TABLE D-l 

MARC II - Identification of Fields 




*608 
♦ 610 
*610 
*610 
*611 
*611 
* 611 
* 618 
* 620 
* 628 

* 630 

* 650 

* 651 

* 652 

* 653 

* 655 

* 1 656 

* 657 
** 1 660 
**661 



0 

1 

2 

0 




Data Element 

Variable Fields (continued) 

Subject Entry 

Personal 9 Title 

Corporate, Surname 

Corporate, Place 

Corporate, Name (Direct Order) 

Conference, Surname 

Conference, Place 

Conference, Name (Direct Order) 

Corporate, Title 

Corporate, N ame with Form Subheading 
Corporate, Name with Form Subheading, Title 
Uniform Title 
Topical 

Geographic Name 

Political Jurisdiction 

Proper Names, Not Capable of Authorship 

General Subdivision 

Period Subdivision 



Place Subdivision 



NLM (MeSH) Heading 

NLM (MeSH) Heading, Topical Subheadings 




TABLE D-l 



MARC II - Identification of Fields 



09 

3 

+» 

(0 



CO 



Ind 



Data Element 

Variable Fields (continued) 
Subject Entry 



** 

** 

** 

** 

** 



662 

663 

664 
670 
690 



* 

* 

♦ 

* 

* 

♦ 

♦ 

♦ 

♦ 

* 

* 

♦ 



700 @ 
A 

700 B 
700 |C 
700 |D 
700 E 

700 If 
700 
700 
700 |I 
700 
700 
708 



NLM (MeSH) Heading, Geographical Subheadings 
NLM (MeSH) Heading, Period Subheadings 
NLM (MeSH) Heading, Form Subheadings 
NAL Agricultural/Biological Vocabulary 
Local Subject Heading 
Added Entry 

Personal, Forename, Alternative 
Personal, Single Surname, Alternative 
Personal, Multiple Surname, Alternative 
Personal, Name of Family, Alternative 
Personal , Forename , Secondary 
Personal, Single Surname, Secondary 
Personal, Multiple Surname, Secondary 
Personal, Name of Family, Secondary 
Personal, Forename, Analytical 
Personal, Single Surname, Analytical 
Personal, Multiple Surname, Analytical 
Personal, Name of Family, Analytical 
Personal, Title 
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TABLE D-l 

MARC II - Identification of Fields 



Q 



Status 


Tag 


Ind, 


Data Elements 


* 


710 


0 


Variable Fields (continued) 

Added Entry 

Corporate 9 Surname, Alternative 


♦ 


710 


A 


Corporate, Place, Alternative 


* 


710 


B 


Corporate, Name (Direct Order), Alternative 


* 


710 


D 


Corporate, Surname, Secondary 


* 


710 


E 


Corporate, Place, Secondary 


* 


710 


F 


Corporate, Name (Direct Order), Secondary 


y. 


710 


H 


Corporate, Surname, Analytical 


* 


710 


I 


Corporate, Placet Analytical 


* 


710 


J 


Corporate, Name, (Direct Order), Analytical 


♦ 


711 


0 


Conference, Surname, Alternative 


* 


711 


A 


Conference, Place, Alternative 


* 


711 


3 


Conference, Name, (Direct Order), Alternative 


* 


711 


) 


Conference, Surname, Secondary 


* 


711 


5 


Conference, Place, Secondary 


* 


711 


? 


Conference, Name, (Direct Order), Secondary 


* 


711 


1 


Conference, Surname, Analytical 


* 


711 


l 


Conference, Place, Analytical 


* 


711 


1 


Conference, Name, (Direct Order), Analytical 




713 




Corporate, Title 



ERIC 



Status 
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TABLE D-l 

BIARC II - Identification of Fields 





Data Element 



* 

♦ 

* 

* 

♦ 

* 

* 



720 

720 

720 

723 

730 

730 

730 

740 

753 



riable Fields (continued) 
Added Entry 



Corporate f with Form Subheading! Alternative 

Corporate 9 with Form Subheading! Secondary 

Corporate! with Form Subheading! Analytical 

Corporate ! with Form Subheading, Title 

Uniform Title, Alternative 

Uniform Title, Secondary 

Uniform Title, Analytical 

Title Traced Differently 

Proper Names Not Capable of Authorship 

Series Added Entry 



♦ 

♦ 

♦ 

* 

♦ 

♦ 

♦ 



300 

800 

800 

800 

808 

810 

810 



1 

2 

3 



Personal, Forename 
Personal, Single Surname 
Personal, Multiple Surname 
Personal, Name of Family 
Personal, Title 
Corporate, Surname 
Corporate, Place 



D - 2C 



TABLE D-l 

MARC II - Identification of Fields 
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TABLE D-2 



MARC II - Internal Identification githin Tagged Fields 




** 



** 

»♦ 

** 

** 

* 

♦ 

* 

* 

* 

** 

** 

** 

** 

** 



003 
003 
003 
003 
012 
012 
012 
030 
050 

050 

051 
051 
051 
060 
060 
090 
090 
090 
100 ,400 | 600 

700,800 



Data Element 



Language(s) of Work 

Language Translated From 

Language of Original 

Language (s) of Summaries 

Standard Book Number — Publisher’s Prefix 

Standard Book Number— Title No. 

Standard Book Number— Check Digit 

More than 1 Dewey Decimal Classification No. 

LC Call No.— Class No. 

LC Call No.— Book No. 

Copy Statement — Class No. 

Copy S cat ement— Book No. 

Copy Statement— Remainder 
NLM Call No.— Class No, 

NLM Call No.— Book No. 

Local Call No. 

J Local Holding Collection Code 
Local Number of Copies 



I 






r 






Personal Name Entries— Name 
Personal Name Entries— Title 
Personal Name Entries— Identifier 
Personal Name Entries— Relator 



* Data Identified Differently 
** New Data 



3 

ERIC 



(Status 
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TABLE D-2 

MARC II - Internal Identification Within Tagged Fields 




♦ 

* 

* 

♦ 

* 



110,410,610| 
710,310) 

111,411,611 
711,811 



Data Element 



r 

.ilame 



Place 



Corcorrte Entriei 



< 



l £ 



Subdivision or 



< 



Name 

Subdivision 



< 



Conference Entries— Name 
Conference Entries— No. 



* 

* 



* 



♦♦ 

♦ 

* 



120,620,720 

J 

240 
240 
240 
250 

250 
300 

300 
300 
300 
408,418,4481 
808,818,34^ 



Conference Entries— Place 
Conference Entries— Date 

Corporate Name With Form Subheadings— Name 
Corporate Name With Form Subheadings— Title 
Title Statement— Short Title 
Title Statement— Remainder of Title 
Title Statement — Remainder to Edition 
Edition Statement — Edition Information 
Edition Statement— Remainder 
Collation— Paginat ion 
Collation — Illustrations 
Col la t ion— Heigh t 
Collation — Thickness 
ISeries— Title 
Series— Number 
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PRODUCTION PROBLEMS AND DECISIONS 



Soon after New Hampshire began receiving cards and labels 
they requested an opportunity to meet with Inforonics about problems 
they had encountered. On February 8, 1968, Inforonics personnel 
went to the NE3HE office in Durham to discuss the output products 
with the UNH catalogers. Listed are the problems, questions, etc. 
considered and the decisions on them reached during the reeting or 
subsequent to it. In some cases, a decision was postpo 3d until 
the other participants have received and considered son * cataloging 
products. 



1. Typeface of labels: Other typefaces are being investigated. 

2. Spacing: 

a. Before edition statement: No NEBHE standard, NELINET oaids 

follow format of UConn, UMe, URI, and UMass, 2 spaces. 

b. Before imprint: No NEBHE standard, NELINET cards follow 

format of UConn, UMe, UMass, and UVt, 2 spaces. 

c. Within collation: Items are not separately e 

MARC tapes so no change is now feasible. NELINET libraries 

vary. 

d. Before series statement: No NEBHE standard, three libraries 

use 3 spaces and three use 2 spaces. NELINET cards w 

use 2, as, other things being equal, the saving of space 

seems desirable. 

e. Between tracing number and tracing data: No J^none**' 

UConn, UMe, UMass and UVt leave one space, UNH leaves none. 
NELINET cards will have on© space for the present. inis 
space could be omitted if the libraries thought the space 

saved warranted it . 



3. Blank lines: 

a. Before the first note: All libraries follow this practice. 

NELINET cards do not in order to conserve space. This could 

be changed. 

b. Before the tracings: Five libraries follow this practice 

(URI format unknown) . NELINET cards do not in order to 
conserve space. This could be changed. 

c. Between the header data on card 2 and continuation of text: 
This blank line is now produced on the NELINET cards. 

4 The BNB number which can be used to order BNB cards will no 

longer be printed on NELINET CARDS; in its place an abbreviation 
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5 . 



6 . 



7. 



8 . 



9 . 



10 . 



11 . 



12 . 



13. 



14. 



15. 



16. 



of the requesting library will be printed to aid card 
distribution. 

Oversize books: Libraries may encounter oblong books whose 
width requires they be shelved as an oversize volume though 
their spine length does not. Present programs will probably 
not handle them correctly; libraries should submit the local 
call number with their oversize symbol. 



Errors in data such as Hobart for Robert are due to mistakes 
in the L.C. tapes. 

Partial title entries: Programs are being corrected to produce 

’’catchword” titles. 



Series tracings: Programs that produce overprinted series 

headings are being corrected. When local series entry prac oes 
differ from L.C. treatment, the library will have to adjust 

the cards or its practice. 

Erratic indention of headings on card two’s has been fixed. 

Branch and special shelf location will a PP® ar . ov !^ a iL Ca11 
numbers for the present. This was agreed to by tne s 
libraries at the meeting on May 24, 1967. 

Author/title or title information repeated on card two’s will 
be in L.C. format. 



To reduce the frequency of card two’s, line 14 will be 
available to :'ata and the continuation statement (Cont. next 
card) will be on the 15th line. 

Tracings beginning on top line of the card: This is done to 

allow the maximum space for the tracings and is L.C. MARC 
practice. One could even assert it assists the catalog user 
to distinguish the heading data from the main entry. 

Punctuation before a subject subdivision: MARC uses 

dash as do UConn and UVt. (UMass and UNH use space 
and UMe uses a single space). To do other than follow MAR 
would require considerable effort, require more sp»‘‘« 
form to the format of the same number of libraries. 

Subject subdivision abbreviations: These are now spelled out. 

It was suggested that in tracings ' Title. and lib^ar ies 
shortened to "T." and "T:" to save space. l®blNET lib i 
nresently do so. The change, while fairly simple, seems nest 
postponed until all the libraries have receive*, some cards. 
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17. New Hampshire felt that inserting the cards and labels into 
envelopes was unnecessary. It will be suspended. If other 
libraries request it, resumption could be reconsidered. 

IS. Margins: 

a. Labels: a one space left margin will be used on labels. 

b. Cards: 

1. A one space margin to the left of the call number will 
be used. 

2. Programs presently produce cards with L.C. format, that 
is, the main entry begins to the left of all other lines. 
This means considerable wasted s^pace as all subsequent 
lines begin at the second or third indention. Changes 
to bring cards closer to the formats of the NELINET - 
braries are being written, "flanging indention will be 
used for the title paragraph of anonymous works as on 

p. 65 of L.C. Colvin's Cataloging sampler. l cf * Cata- 
loging Service bulletin no. 69 and AA rules footnote 
p. 192). Line endings will not be indicated for titles 
in verse, etc. as the MARC record does not provide for 
them (cf. AA rule 133). 

19. L.C. call number: Present programs to divide the L.C. call 

number for labels and for the left margin (gutter) 
of the cards are imperfect. 

a. When the numeric part of the cla f s ^ f icat ^ on . n ^? b ®^ 4 ^ ^he 
than six characters long but includes a decimal 
programs divide on the decimal which is contrary to 5?®® 
library practice and to the format specified in the Pre- 
sentation notes for the May 24, 1967 NEBHE meeting. 

b. When a period is followed by a space, e.g., vol. 2, two 
carriage returns are executed resulting in a blank line 

between "vol.” and ”2”. 

c The decimal point is not being dropped before the book 

number as was also specified in the May 24, 1967 presents on 

notes . 

d. An algorithm to solve the problems of L.C. call number divi- 
sion has been developed by W. Nugent. Programming to in- 
corporate it into the present programs has been postponed 
to allow manual testing of it. The Present effect of the 
above problems should be small because "a" and b occur 
infrequently and "c" is not a problem until Rhode Island 
or Massachusetts begins requesting cards because New Hamp- 
shire presently retains the decimal point before the boo* 

number . 



o 
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20. Title tracing: Present programs remove punctuation other than 

periods , question marks, and exclamation points from titles 
before using them as a heading. New Hampshire expressed the 
desire that a period appear at the end of all such headings. 
This programming is possible but has been postponed until 
other libraries have a chance to react to the cards. 

21. Future problems: To enable Inforonics to assess and improve 
the quality of the products, New Hampshire will fill out 
problem reports for each new problem and annotate and return 
cards which are not usable. Both will be forwarded to R. 
Simmons. As soon as 7, 3, and 12 are solved, New Hampshire 
will resume requests. 
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OPERATING PROCEDURES FOR 
INFORMATION TRANSMITTAL IN 
NEW ENGLAND LIBRARY INFORMATION NETWORK 



Preoared by 
Robert H Simmons 



October 18, 1967 



OPERATING PROCEDURES FOR 
INFORMATION TRANSMITTAL IN 
THE NEW ENGLAND LIBRARY INFORMATION NETWORK 



Installed at each of the participating libraries is a 
teletypewriter, model ASR-33. These teletypewriters will be used 
to transmit requests between the participating libraries and the 
NELINET central processing center. The requests will be for 
catalog cards, book spine labels, and book pocket labels, for the 
participating libraries and their branches. 

The requesting procedures hr.ve been made rd concise r.s 
possible to facilitate ease of use by the cataloger, and ease of 
transmittal by the transmitting clerk. Suggestions for easier 
methods are actively solicited. 

NELINET Request Form 



The NELINET Request Form is similar for each of the 
participating libraries. The area of deviation is the library 
code and the shelf list and branch name abbreviations. The 
request form will eventually have more information included in 
it as the demonstration period progresses into title and/or 
author searching. 

A short explanation of each lir.o follows. A more 
comprehensive explanation of each line is included in the section 
entitled "Cataloger* s Procedures for Filling Out Request Sheets". 
We will use Figure 1, ^r^e 12 as an example work sheet. 

RI This will be preprinted on all Rhode Island library 

request sheets. This is a library identifier and will 
be transmitted with every request. 

LC This line is for Library of Congress call numbers. All 

requests for information will have this line filled in. 
At a later stage in the project this may not be known 
and appropriate author and title lines will be added. 

AT The next tour lines in the sheet allows the user to 

specify where the books are destined to be sent, the 
number of copies, whether they are going onto a special 
shelf, and whether multivolume sets are involved. 

CN This line is available for libraries who wish to have a 

local call number placed on their cards and labels 
instead of the Library of Congress call number. This 
allows those participating libraries who use Dewey 
Decimal call numbers to use the system. This does not 
affect the normal procedure of getting the Library of 
Congress call number on the bottom of the catalog cards. 
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; ”Xtra Cards” If th« library wishes to have extra main 

6ntry Tjards produced above the normal set, then by writing 
in the amount this will be accomplished. 

'LP ”No Cards, Labels, Pockets” This is a multi-use line in 

tTTat”thc library may suppress the production of catalog 
cards, spine labels, nd book pockets, or any combination 
of the three outputs. 

Cataloger Procedure for Filling Out the 
m'LlWT RequesT^Form 

Whene*/er a cataloger wishes to request catalog cards, 

>ok spine labels, and/or book pocket labels through the New 
igland Library Information Network Processing Center he will have 
> complete a NELINET Request Form. This form has been made as 
lort as possible and, we hope, as easy as possible to fill out. 
art of the data has been preprint? 1 on the forms to further 
acilitate fast and easy completion by the c°taloger. A separate 
>rm has been created for each participating library differing only 
r the library code name and the listing of abbreviations of shelf 
1st names and branch library names. 

The request forms are to be filled out in the following 
ay: 

ine Name 



0,MA,ME, 

Hi RI or 

1 *’ ’ This line is the library identifier. One of the previous 

state abbreviations will be preprinted on each of the 
request forms. No action needs to be taken by the 
cataloger as this will be transmitted by the teletype 
operator. 

C The cataloger must enter the Library of Congress card 

number on this line. The cataloger is to write this 
number in the form in which it normally appears on L.C. 

cards . 

Exampl e: 

64-4302 
HEW 66-2362 

L *X* This line, plus the next three lines, are provided for 

the cataloger to enter the location (copy is at....), 
copy-shelf statement end volume information. Some 
examples of how this information is to be filled out are 

given in 4-9. 
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CN 



xc 



NCLP 



If the cataloger wishes to have a c?ll number that is 
different from the Library of Congress call number then 
he may write this number on this line. This call number, 
referred to as a local call number, will then be placed 
in the left margin of all cards produced by this reques . 
The local call number will also be used on all book spine 
labels, and book pocket labels produced by this request. 
If this line is not used, then the Library of Congress 
call number that appears on the bottom of the L.C. card 
will be automatically placed in the left margin of the 
catalog cards, and all labels produced by this request 
will contain the L.C. call number. 

Refer to . 2 IZ for the proper wry to write this 
number on the cards . 

If the cataloger wishes to have more than the 
amount of main entry cards (Xtra Cards) producedbythe 
system for this request, the cataToger will write the 
extra amount he wishes in this space. This amount of 
extra main entry cards will be produced for this 
request in addition to the normal output. 

This line will be used by the cataloger if he wishes *° 
prevent the output of catalog cards, book spine labels, 
and/or book pocket labels, (No Cards, Labels, Pockets)^ 
The cataloger must circle the CTif car<Js are to be up- 
pressed, the L if book spine labels are to be suppressed 
and/or the P Tf book pocket labels are to be suppressed. 
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This appendix will explain the proper way to fill out 
the location - copy shelf location, and volume statement. This 
line, which we will refer to as the technical process statement, 
or TPS, is used to produce the proper number oi labels and select 
the correct profile for catalog cards. The TPS is made up of three 
separate parts: 



1. Location This may be the branch abbreviation 

or the main library. 

2. Copy - shelf 

statement. The copy information and shelf 

location information is inserted 
in this section. 



3 . Volume informa- 
tion If the book consists of more than 

one volume they are noted in this 
section. 



The parts are filled out in the following fashion: 

1. Location The cataloger is to insert in this 

space the branch abbreviation or 
the word ’’main” . 



Example 1 : 

A request is made for the central 
library. 

Solution : 

Write m £ i^ n in the space marked 
’’location” . 

Ex ample 2 ; 

A request is made for a branch 
library. 

Solution : 

Write the branch abbreviation in 
the space marked ’’location”. 

Example 3 : 

A request is made for both a branch 
location and the central library. 



o 
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Solution : 

Write ra a i n on the first line, 
and the TJranch abbreviation on the 
second line. 

A semicolon must be placed between location and copy 
information. This indicates the end of the location section and 
the beginning of the copy - shelf statement section. 

2. Copy-shelf 

statement The cataloger is to insert in this 

space the copy information and the 
shelf abbreviation if applicable. 
The following examples are given 
as guides . 

Example 1 : 

The cataloger requests cards for a 
single book and it is copy 1. 

Solution : 

The cataloger will write c.l in 
the space provided. 

Example 2 : 

The cataloger requests cards (and/ 
or labels) for three books, copies 
1, 2, and 3. 

Solution : write jc. 1-3 

Meaning : 

The " |” means to enumerate the 
information following the bar. 

The information supplied in this 
case meant the span covered was 
c.l, c.2, and c.3. 

Example 3 : 

The cataloger requests labels for 
copies 1-4 of a book and copy 4 is 
to go to the reference section. 

Solution : writeJc.1-3, c.4 Ref 




F - 6 



The bar ( ) means to enumerate the 
following information (up to a 
co mm a) if more than one copy— shelf 
statement item is listed. The 
comma separates subparts of the 
copy- shelf statement c.4 Ref means 
the labels produced for copy ' 
will have Ref printed on them. 

Example 4 : 

The cataloger request labels for 
5 books, 2 to go to reference, 1 
to the central library and 2 to the 
Blatz collection. 

Solution : 

c.l, | c. 2-3 Ref, j c.4-5 Elatz 

If volume information is involved then a semicolon will 
be used to separate copy information from volume information. 

3. Volume informa- 
tion The cataloger is to insert volume 

information in this space if it is 
present. The following examples 
are given as guides. 

Example 1 : 

A single volume is involved (as- 
suming this is the first of several 
to be issued in a set) . 

Solution: 



v . 1 

Example 2 : 

Two volumes, volumes 1 and 2 are 
involved. 

Solution : 

| v.U2 

As in the copy-shelf statement the 
bar ( | ) means to enumerate the 
information following, up to a 
comma, if one is present. 
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Example 3 : 

/ 

Two volumes are involved volumes 
2 and 4. 



Solution : 
v.2, v.4 

The comma separates subrerVionE. 
of the volume information part of 
the Technical Processing Statement. 

Example 4 : 

Four volumes are involved volumes 
1, 2, 3, and 5. 

Solution: 



|v.l-5, v.5 

The bar signifies that the informa- 
tion following is to be enumerated. 
The comma ends the part to be 
enumerated. 



Assume that the library wished to request processing cards 
for a single book. The statement will be the following: 

AT Q MAIN 

Note: c. 1 does not have to be inserted if only 1 copy 

is present and no volume numbers are involved. 

Assume the library wishes to request processing for 
the* second copy of the above book at a later time. 



AT q main; c.2 

Assume tho library wiohes to request proocooing for two 
copies of a book 1 and 2 . 

AT Q main; I c. 1-2 

for five copies, 1, 2, 3, 4, and 5 
ATq main; | c.1-5 

The present copy span can be up to 99 copies. 



o 




n 



T? _ 

a. 

Assume the library wishes to request processing for four 
copies , a replacement for copy one, which was destroyed and copies 
four, five, and six. 

AT □main; c.l, |c.4-6 

If the copy one above was going to a shelf location, say 
reference, then the AT statement would look like this: 

AT □ main; c.l Ref, |c.4-6 

Assume copy one was for the central library, copy two to 
four for the Ref collection, and copy five for the xyz shelf col- 
lection. 



AT q main; c.l, J c . 2-4 Ref t c . 5 xyz 

Assume that on the above order two volumes v.l and v.2 
are involved. 

ATQ main; c.l, | c.2-4 Ref ; c.5 xyz; | v.1-2 
Assume that only volume one is going to the xyz shelf 

1 ocation. 

AT q main; c.l, | c.2-4 Ref# | v.1-2 

and 



AT q main; c.5 xyz; v.l 
Assume copy 1 to main library 

copy 2 to mathematics branch 
copy 3 to reference collection. 

ATq main; c.l, c.3 Ref 

and 

ATq MATH; c.2 

Probably the most common request will be a single copy tor a cHneio 
location which will look like: 

AT □main 

The following is an explanation of each character in an 
example s+.atoinoiit . 
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Example : 

ATq main; je. 1-3, c.4 Ref; |v. 1-3 

AT n = the prefix of the statement which allows the 

' program to interpret the statement as a technical 

^rocvsssing statement. 

main * the location for which the request is made. Up 
*— 1 — 1 — to six characters may be entered in this space. 

If any entry is made other than the word 'main 
the line will then be considered information for 
a Branch library and this location will appear 
on all output . 

. = the semicolon separates the location information 

“ from the copy— shelf information. 

| „ the bar indicates that information following up 

to a comma or semicolon must be enumerated. 



c# l_3 s the spread of copies to be enumerated. 

, » separates the sub-parts of the statement. 

c 4 Ref * the copy number and its shelf location. This 

— automatically will be placed on all the catalog 

cards on labels produced for this copy number, 
(or numbers if it is to be enumerated) . 

; = separates the copy-shelf statement from the 

~ volume information. 

I - enumerate the following volume information up to 

a comma or end of the statement. 

v# l_3 8 the spread of volumes to be enumerated. 
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A local call number or CN field on the request sheet is 
available for the libraries who wish to have a call number ,® th ®„ d 

than the Library of Congress call number on the ige lib rrries 
the margin of catalog cards. This provision allows those J; ib ™ rl 
who are using the Dewey Decimal Call Hunger system or their ow 
unique call numbering system to use the NELINET. 

Certain arbitrary conventions are used to indicate upper 

case characters, but these conventions will only w Ere 

transmittal clerk rather than the catalogers. ^thf^filld^nd 
examples of formatting for local call numbers in this field and 

how the number will appear on the output medium. 



Example 1: 



574/.S2074 



Will look like: 



547 

.92074 



may appear on a single line. 



Example 2 : 

QP/4C1/.H4S13 

Will look like: QP 

481 

.H4813 



Example 3 : 

QL/473/ . M25/1964 

Will look like: QL 

478 

. M2 5 
1964 
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Sample Request Sheets 

The following is a number of sample request sheets that 
have been filled in with request data. 

In Figure 1, the library is requesting cards, spine labels 
and pocket labels for a single copy for the main library, ^tie 
Library of Congress call number is filled in and under the Technical 
Processing Statement the word main has been entered. This is the 
minimum amount of information TTTaF can presently be entered to 
produce card sets, spine labels, and book pocket labels. 

Figure 2 shows a request for cards and book pocket labels 
for 2 copies, one of which will be in the reference collection. 

The library has also requested one extra main entry card. Note 
that the L on the last line has been circled. This is done to 
indicate suppression of spine labels. 

Figure 3 is an example of a request for catalog cards and 
spine labels for three copies, copies one and two are for the mam 

library and copy three is for the Rhode Is * a " d _°" he local 

location, (c.3 R.I. CL) The library has also filled in the local 

call number line (CN D ) . This number will appear on a J* ca * a £| t 

cards and spine labels for this order. The P is circled to prevent 

pocket label production. 

Figure 4 shows a request for catalog cards, book spine 
labels, and book pocket labels for three books, v.l c.l to the 
central library card v.l c.2, v.2 c.2 to the NML branch library. 

Two groups of catalog cards will be produced for this request: 
group one will use the general library formats and ‘"‘J "J* 1 

use the branch library formula. The spine labels a " d ^ “ a Ff in 
area of the catalog cards for the branch (NML) request will b 
headed with the branch abbreviation in this case NML. 

Figure 5 is a sample request for catalog cards and book 
spine labels for eight items consisting of copiesi-Sofvoiijme 
2 and 3 for the central library, and copies 1-2 of 
branch library called EXT. The library is also requesting that the 

local call number be used in margin of the card rath ®J\“* a " *j® 
normally supplied L.C. call number. Two extra main entry cards 
have been requested and book pocket labels are not to be prod 



The following is an image of what each spine label 
would look like from thdo request (Figure 5) . 



824 

.8 

1964 

v.2 

c.l 



824 

.8 

1964 

v.2 

c.2 



824 

.3 

1S64 

v.2 

c.3 



824 

.3 

1964 

v.3 

c.l 



324 

O 

is64 

v.3 

c.2 



824 

.8 

1964 

v.3 

c.3 



EXT 


EXT 


824 


824 


.8 


.8 


v.l 


v.l 


c.l 


c.2 




* ^ - ~f^U#*****l 
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NELINET Request Form 



RI 



LC D 
ATq 

at d 

at d 

at d 

CNfl 



63 - n c fS3 




L.C. Card Ni 

M A/A/ 


imber 




Location 


Copy-Shelf Statement 


Volume 















Local Call Number 



XC 



^ Extra Ham Entry Cards 



N CLP 

No Cards Labels Pockets 



Branches 



NML 

GLS 

EXT 



Shelf Locations 

Rare 

R.I.C1 

J.F.K. 

Blatz 

Ref 

Juv. 

Archiv 
Thesis 
mfilm 
mcard 
mf iche 



FIGURE 1 



i o 

ERIC 
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NELINET Request Form 



RI 



lc n £3 - / 7Y 6 3 

L.C. Card Number 



AT Q 

AT D 

AT q 

at d 

cn d 



MPtl A/ 


: C. 1,0. A, A 't-f 1 




Location 


> ' Copy-Shelf statement 


Volume 















Local Call dumber 



XC 



D 



A 

X 



Extra Main fcntry Cards 



Branchee 

NML 

GLS 

EXT 



N C (l)p 

No Cards Labels Sockets 



Shelf Location s 

Rare 

R.I.C1 

J.F.K. 

Blatz 

Ref 

Juv. 

Archiv 
Thesis 
mf ilm 
mcard 
mf iche 



I 



FIGURE 2 



o 

ERIC 
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NELINET Request Form 






□ 



□ 



□ 



□ 



63 -£36J 7 

L.C. Card Number 



M A IN 


lc.l-, 1 , C.3 R.X. CL , _j 




Location J 


Copy-Shelf Statement 


Volume 















•! n 51 4 A *1X01 + 

u XTocal call Number 



2 r*i ^ 

w Extra Main Entry Cards 



CL \S) 

No Cards Labels pocnets 



Branches 

NML 

GLS 

EXT 



Shelf Locations 

Rare 
R. I .Cl 
J.F.K. 

Blatz 

Ref 

Juv. 

Archiv 
Thesis 
mfilm 
mcard 
mf iche 



o 

ERIC 



FIGURE 3 
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NELINET Request Form 



RI 



L.C. Card Number 

AT- ' M f\> V 



V. X. 



-h 



Copy- She 11 Statement 7 volume 

I / . / - 2 ^ 



ATr 



Location 



C - 



A/ A) L 



C. A 



AT, 



□ 



AT— j 



CN; 



□ 



xcn 



Local Call Number 



Extra Main Entry cards 



N CLP 

No Cards Labels Focliets 



Branches 

NML 

GLS 

EXT 



Shelf Location s 

Rare 

R.I.C1 

J.F.K. 

Blatz 

Ref 

Juv. 

Archiv 
Thesis 
mf ilm 
mcard 
mf iche 




FIGURE 4 
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NELINET Request Form 



RI 



LC -, (d± -j±^XL 

L ' r.C. dard Number 



at d 

AT 

□ 

AT d 

at d 



M * i V 


t 1 C • L- 3 


4 

J— 


1 V*£-3 


Location 

zxr 


/ Copy-Shelf Statement 




Volume 

V. J- 



















CN tfc 'J-i'/ . $ / £ 

□ Local Call Number 

XC r - C ^ 

L - Extra Main Entry Lards 

N C L (® _ 

No Cards Labels Pockets 



Branches 



NML 

GLS 

EXT 



Shelf Locations 



Rare 

R.I.C1 

J.F«K, 

Blatz 

Ref 

Juv. 

Archiv 

Thesis 

mf ilm 

mcard 

mf ich^ 




FIGURE 5 



APPENDIX G 



COMPRESSION WORD CODING TECHNIQUES: 
TRANSITION DISTANCE CODING, ALPHACHECK 
RECURSIVE DECOMPOSITION, AND SOUNDEX. 



"I have said enough to convince you that 
ciphers of this nature are readily soluble, 
and to give you some insight into the ra- 
tionale of the development. But be assured 
that the specimen before us appertains to 
the very simplest species of cryptograph.” 

—Edgar Allen Poe 
The Gold Bug 
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Gl. INTRODUCTION 

Cryptographic studies have documented much useful 
language data having application to retrieval coding. Because 
unclassified cryptographic studies are few, Fletcher Pratt’s 1939 
work* remains the classic in its field. Gaines has the virtue 
of b§ing in print, and the more recent cryptographic history of 
Kahn", while comprehensive, lacks the statistical data that made 
the earlier works valuable. The work coding problem for language 
processing, as opposed to cryptography, has been extensively studied 
by Nugent and Vegh^. Information theorists have contributed the 
greatest volume of literature on coding, and have added to its 
mathematical basis, largely from the standpoint of communications 
and error avoidance. 

We present here a brief discussion of compression codes 
and their objectives, and then describe four compression codes hav- 
ing application to retrieval directories. 

Transition Distance Coding is a new method that has been 
devised for this project. It is a randomizing code that results 
in short codes of high resolving power. 

Alphacheck has also been devised for this project, and 
combines high readability with good resolution. It permits simple 
truncation to be used by means of applying a randomized check 
character that acts as a surrogate of the omitted portion. It 
appears to have the greatest potential, in directory applications, 
of the codes considered here. 

Recursive Decomposition is a selected letter code that 
was devised by the author several years ago • It has been tested 
and has the advantages of simple derivation and high resolution. 

Soundex 5 is the only compression code that has achieved 
wide usage. It was devised at Remington Rand for name matching 
under conditions of uncertain spelling. 

G2. OBJECTIVES OF COMPRESSION CODING 

It is desired to transform sets of variable length words 
into fixed length codes that will maximally preserve word to word 
discrimination. In the final directories to be used, the codes for 
several elements will be accessible to enable the matching of 
several factors before a file record is selected. The separate 
codes for differing factors need not be the same length, though 
each type of code will be of uniform length, nor need the codes 
for differing factors be derived by the same process. 

What we loosely call codes, must be formally ciphers. 
That is, they must be derivable from the data words themselves, and 
not require "code books" to determine equivalences. This is so 
because the file directories must be derivable from file items, 
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entries in directory form must be derivable from an input query, 
and these two directory items must match when a record is to be 
extracted. The ciphers need not be deciperable for out application, 
and in general are not. 

Fixed length is generally desirable for machine directories 
since this provides the rough equivalent and simplicity of a margin 
entry in a paper directory. We will examine the question of variable 
length directories and directory list structures in a later memo. 

The functions of the codes will determine their form, 
and a code or file key designed to meet one objective will generally 
not be satisfactory for any other objective. We will clarify this 
point by illustrating some typical objectives: 



(a) File ';ey for extraction of records in ap- 
proximate file order. (Sorting and Print- 
out Problem.) 

Typical code construction rule: Take first 

3 ix letters. 



JOHNSEN JOHNSE 
JOHNSON =-$> JOHNSO 
JOHNSTON JOHNST 
JOHNSTONE JOHNST 

(b) File key for extraction of records under 

conditions of uncertainty of spelling (Air- 
line reservation problem.) 

Typical code construction rule: Vov/el Elimination 
or Soundex. 



Typical matching rule: Best match. 



Vowel Elimination 



Soundex 



JOHNSEN =*> JHNSN 
JOHNSON =-s> JHNSN 
JOHNSTON —=> JHNSTN 
JOHNSTONE — > JHNSTN 



J525 — £> J52 
J525 — ■ > J52 
J5235 *-> J52 
J5235 ««> J52 



(c) File key extraction of records from accurate 

input, with objective of maximum discrimination 
of similar entries (cataloging search problem.) 



Typical code construction rules: Recursive 
Decomposition Coding or Transition Distance 
Coding. 




Recursive Decomposition Transition Distance 



JOHNSEN ==* JHNSEN 
JOHNSON — JHNSON 
JOHNSTON JHSTON 



BFTZ 

DNWU 

ZIKY 

ECRC 



JOHNSTONE »» JHSONE 



In file keys we are concerned with presently, we 
assune accurate input data and the objective is 
maximum discrimination. Since it would be nice 
to have our cake and eat it too, we would like a 
code to be as discriminating as Transition Distance 
Coding and to be as readable as truncation coding. 
We achieve this, possibly, by combining the two 
codes into one, with an initial portion truncated 
and a final check character representing the 
remainder via a compressed Transition Distance 
Code : Alphacheck . 

(d) File key for human readability and high word 
to word discrimination. 

Possible code construction rules: Alphacheck: 

Simple truncation plus a terminal check character. 



JOHNSON ==> JOHNSX 

JOHNSTON «£> JOHNSD 

JOHNSTONE «£> J0;INS3 

We describe these procedures in the following 
sections . 

G3. TRANSITION DISTANCE CODING 
It is axiomatic that randomizing codes give the greatest 



possible discrimination ^or a given code space. The whole trick 
of creating a good compression code is to eliminate the natural 
redundancy of English orthography, and preserve discrimination in 
a smaller word size. 



Letter-selection codes can only half accomplish this, 



due to the skewed distribution cf letter usage. They can eliminate 
the higher frequency components, but they cannot increase the use 
of the lower frequency components. 

Randomizing codes - often called "hash" codes, properly 
quasi-random codes - can equalize letter usage and hence make best 



JOHNSEN ==!> 



JOHNS V 
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use of the code space. Prime examples here are the variants of 
Go'del coding devised by Vegh 4 in which the principle of obtaining 
uniqueness via the products of unrepeated primes is exploited, as 
it is in the randomizing codes we consider here. The problem in 
design of a randomizing code, is that the results can be skewed 
rather than uniformly distributed due to the skewed nature of the 
letters and letter sequences that the codes operate on. 

In Transition Distance Coding, we overcome the natural 
bias of letters and letter sequences, by operating on a word 
parameter that is itself semi random in nature. We advance the 
following principle, not quite a theorem. 

Principle: Considering letters in their 

normal ordinal alphabetic position, and 
considering letter transitions to be 
unidirectional and cyclic, the distribu- 
tion of transition distances in English 
words is essentially uniform. 

In view of the fact that letter usage has an extremely 
shewed distribution, with a probability ratio in excess of 170 to 1 
fcr the extremes, it is seen that the more uniform parameter of 
transition distances is a superior one for achieving randomized 
codes. The relative uniformity of transition distance needs further 
investigation, but one typical letter diagram sample from Gaines^ 
with 9999 transitions (means number of occurrences of each distance = 
335) yielded a mean deviation of 99 and a standard deviation of 
123, and an extreme probability ratio of 3.3 to 1 for the different 
transition distances from 0 to 25. The distribution can be made 
more uniform by letter permutation. Permutation is used in the 
algorithm for Transition Distance Coding but not in Alphacheck. 

It's value will be determined. 

G3.1 ALGORITHM 

The method of Transition Distance Coding is used to 
operate on a variable length word to achieve fixed length alphabetic 
or alphanumeric codes that exhibit quasi-random properties. The 
code is formed from the modulo product of primes associated with 
transition distances of permuted letters. The method is intended 
strictly for computer operation, as it is a simple program but an 
extremely tedious manual operation. There are five steps: 

(1) Permute characters of natural language word. 

This breaks the diagram dependency that could 
make the transition distances less uniformly 
distributed. This step might be dispensed with 
if the resulting distributions prove satisfactory 
without it. The permutation process consists of 
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tailing the middle letter (or letter right of 
middle for words with an even number of let- 
ters), the first, the last, the second, the 
next-to-last , etc. until all letters have 
been used. That is, for a letter sequence: 



a 



1' 




• • • 




, a 



n 



The following permutation is taken: 



a _ w n,l a , 

Int(2+ ) t i 



a » 

n 



a 



a 



2 9 n-1 



9 • • * a . » 

(l+i) 



2i f • • • 

(n-i) 



a n + , , 



where Int and Rem refer to the integer part and 
remainder, respectively. To illustrate a typical 

case: 



JOBNSEN — i> N JKOEHS 

(2) Take transition distances of the characters. 

We assign letters a position value corresponding 
to their normal ordinal alphabetic positions 
excepting Z, which we equate to 0, (e.g., A - 1, 

Y * 25, Z - 0), and take the transition distances 
between successive letters of the input sequence. 
Distance is measured unidirectionally in alphabetic 
order, and cyclicly (i.e., "around the bend,” Z to 
A.) The sequence AX has the transition distance 
Nv-Na«24-1=23. Negative distances are converted 
to their positive cyclic distance ty taking their 
26 complement. That is, the sequence XB has the 
transition distance N 5 -N.,=2-24=-22=»=> 26-22-4. To 
follow the * JOHNSEN* example: 

NJNOEHS =4 (14,10,14,15,5,8,19) (22,4,1,16,3,11) 

letter distances 

numbers 



(3) Associate with each transition distance, a cor- 
responding prime number • Table G1 shows the 
primes corresponding to the transition distances. 
We begin with 5 so that the alphanumeric base (36) 
and all numbers are relatively prime. We follow 
the example above: 

(22,4,1,11,3,11) (29,13,5,61,11,41) 

distances primes 




NkJX{sS<Sc3HW ta/o *d05Sgt H Wt4H«0'*jWdOlJJ> 



LETTER 



LETTER POSITION AND 
DISTANCE VALUE 



PRIME NUMBER 



1 


5 


2 


7 


3 


11 


4 


13 


5 


17 


6 


19 


7 


23 


8 


29 


9 


31 


10 


37 


11 


41 


12 


43 


13 


47 


14 


53 


15 


59 


16 


61 


17 


67 


18 


71 


19 


73 


20 


79 


21 


83 


22 


89 


23 


97 


24 


101 


25 


103 


0 


107 



TABLE G1 

Letter Positions and Primes used in 
Transition Distance Coding and Alphacheck 



G - C 



(4) Multiply these primes , modulo the capacity of 
the computer. Integer multiplication in single 
precision is effected, disregarding overflow. 

For a computer with an 1G bit word length 
containing a 1 bit sign position, we multiply 
modulo 2 1 '. That is, we disregard product 
portions that equal or exceed 131,072, For 

a machine of this type, then, we will be 
generating a quasi random number in the range 
of 0 to 131, 071. This is converted to alpha- 
numeric form in the next step. Following the 
example : 

89 x 13 :: 5 x 61 - (352,885) Mod2 17 - 90,741 

90,741 x 11 - (998,151) Mod2 17 - 80,647 

17 

80,647 x 41 - (3,306,527) Mod2 - 39,727 

(5) Convert to alphabetic or alphanumeric form. 

We now express the number derived above as an 
integer base 26 (alphabetic form) or base 36 
(alphanumeric form) , We will use a 4 digit 
code. In the case of alphabetic representation 
we use the letters to represent the numbers of 
their ordinal position (A*l, B«*2, etc.), and 
use Z as zero. In alphanumeric form we would 
use the digits 0 to 9 to represent this range, 
and the letters A through Z would represent 
the range from 10 to 35. 

Using the 18 bit word length we have assumed, 
tie alphabetic form is as good as the alpha- 
numeric. The range of the random number extends 
to 131,071; the range of 4 digit alphabetic 
representation extends to (26^-1) * 256,975; the 
range of 4 digit alphanumeric representation 
extends to (36^-1) = 1,676,615. Hence, the 
alphabetic representation is sufficient. Y7g 2 
divide the random number successively by 26°, 26,, 
26*, and 26° to obtain the alphabetic form. We 
follow the example: 

39.727/2S 3 - 2 + Rem 4575 B 

4575/26 2 - 6 + Rem 520 =*> F 

520/26* - 20 + Rem 0 T 

0/26° - 0 -S> Z 




JOHNSEN 



BFTZ 
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G4. ALPHACHECK 

Alphacheck is a means for creating a randomized alpha- 
numeric check digit. 17 hen used with a selected letter compression 
code, it operates on the missing letters to generate a single 
character surrogate. We use it to add discrimination to a simple 
truncation code, and thereby we hope to attain a compression code 
that is both readable and resolving. 

A process practically identical to that of Transition 
Distance Coding is used, except that at the final step the random 
number is taken modulo 3S and expressed as an alphanumeric character . 
The ten numeric digits represent themselves, and the letters A to 
Z represent the mod 3G numbers from 10 to 35, or their ordinal 
alphabetic value plus 9 . 

In this case, the difference between an alphabetic 
representation and an alphanumeric one is significant, since only 
one character is used, and the range of the Alphacheck character 
is much smaller than the range of the binary random number it is 
derived from. 

The probability of no repetition of Alphacheck codes 
in a sample of size r, is a case of determining the probability of 
uniqueness for sampling with replacement from a population n, for 
which: 

n! 

p = - 

n r (n-r)! 

where n is the range of the code, for alphanumeric Alphacheck, 
n = 36. 



The median of the distribution of p, r^ gives the sample 
size for which the probability of uniqueness is u.5. This is 
estimated by taking the logarithmic form of p, which yields a good 
approximation when n is large with respect to r. 

2 

In pta - _£ 




2n 



1/2 



[2n ln( .5) ] ** 



1/2 

1.18 n » 



7.08 



By comparison, r for n=26 is 6.05; for n-131.072 (Transition 
Distance Coding ?n 4 characters and modulo 2 i7 ) r ffi is 427. 
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We i.iay conclude that the alphanumeric Alphacheck (36 
symbol) has a 50% expectation of uniquely resolving 7 otherwise 
identical 5-letter truncations of source words; this is a one 
word advantage over the 26 symbol alphabetic Alphacheck. Hence, 
we will use the alphanumeric form. 

G4.1 ALGORITHM 

It is not appropriate to use the identical randomizing 
method of T.D.C, (Transition Distance Coding), since this was 
designed to operate on full words, whereas we wish to operate on 
the omitted remainders of truncated words, which are often as short 
as two letters. When a two letter remainder exists only one 
transition distance is involved, and hence only one prime number; 
and the individual primes are not uniformly distributed modulo 36. 
Hence, in the case where only one transition distance exists, the 
corresponding prime is multiplied by two additional primes cor- 
responding to the letters involved (Table Gl.) If only two 
distances are involved, we associate another prime corresponding 
to the last letter. Since randomization is created largely by 
the multiplicative properties of the process, we insure that at 
least three factors are multiplied in all cases. Except for this 
difference in step three, the randomizing process is essentially 
identical to that of TDC. The steps are: 

(1) If word is six letters or less take whole 
word, otherwise, take first five letters 
and compute an Alphacheck character for 
the sixth, based on the omitted letters. 

(2) Take transition distances of the omitted 
letters (as in TDC). 

(3) Associate with each transition distance a 
corresponding prime number (as in TDC), If 

only one transition distance exists, additionally 
associate prime numbers with the remaining 
letters. If only two transition distances 
exist, additionally associate a prime number 
with the last letter. 

(4) Multiply these primes, modulo the capacity 
of the computer (as in TDC). 

(5) Convert to alphanumeric form in 1 symbol, 

modulo 36, in which 0 ==>> 1, ... , 9 9, 

10 => A, 11 =5> B, ... , 35 ==> Z. 

The example of the JOHNS - names, shown in Table G2 

illustrates the process. 




Name 

Truncated 

Portion 

Remainder 

Letter # 

Distance 

Distance 

Primes 

Letter 

Primes 

Product 

Mod 2 17 

Mod 36 

Alphacheck 

Character 

Resulting 

Code 



3 

ERIC 
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JOHNSEN 


JOHNSON 


JOHNSTON 


JOHNSTONE 


JOHNS 


JOHNS 


JOHNS 


JOHNS 


EN 


ON 


TON 


TONE 


5,14 


15,14 


20,15,14 


20,15,14,5 


9 


25 


21,25 


21,25,17 


31 


103 


03 , 103 


33,103,67 


17,53 


59,53 


53 


- 


27,931 


322,081 


453,09? 


572,783 


27,931 


59,937 


59,331 


43,495 


31 


33 


13 


3 


V 


X 


D 


3 


JOHNSV 


JOHNSX 


JOHNSD 


JOHNS3 



TABLE G2 

Example of Key Generation by Alphacheck 
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G5. RECURSIVE DECOMPOSITION CODING 

This method uses a frequency ordering of letters, and 
selection or rejection of a particular letter is based on that 
letters relative order in the table with respect to the previous 
letter. It thus gives a statistical advantage, though not an 
absolute one. to the lower frequency letters. Since many words 
differ only in high frequency vowels (e.g., COMPUTE, COMPETE, 

COMPOTE) , this relative order feature adds a randomizing aspect 
to selection that permits inclusion of occasional high frequency 
letters. 

, The frequency ordering used is taken from tables in 
Pratt 1 . Different word samples will yield slightly different 
orderings, but the cipher resolution is not sensitive to minor 
orderings. The Pratt ordering ie: 

ETAONRI SHDLFCMUGYPWBVKXJQZ 
The algorithm is: 

"If a source word is longer than six letters, select 
the first letter and subsequent letters of lesser or equal order- 
ing than the prior letter, and continue the process recursively 
until six letters remain. Words of six letters or less are 
reproduced in full and filled out with null symbols, where neces- 
sary, until a total of six characters is reached.’’^ 

Several examples will illustrate the system. Omitted 
letters are shown circled, and successive cycles are shown by 
arrows . 

1. B®bQ)G0P(|ER^BBOPER 

2. I (g)F ©R M O) I ©N=5»I F R M I N 

3. S H®K(f)S P(g)A R©J>S H K©P A R^>S H K P A R 

4. SKIT H=>S M I T H □ 

5. K0G®F@RD0H0M©FRDM^KFRD1IH 

6. K(§) I C H ^ M®0 R®hQ&>K(DS H M@R H^K S H M R H 

In some very rare cases, an emerging cipher may have more 
than six letters in descending sequence, so that it will not de- 
compose further. In such cases the final letters are eliminated 
until six remain. 
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most woras, nowever, will reduce in one or two cycles. 

In a test of 55,000 words only one was found requiring four cycles. 



Most words, however, will reduce in one or two cycle; 



A few extreme cases do exist, however: the longest ever found 

required six cycles: 



Only slightly shorter, the longest word in Shakespeare* s works 
(Love*s Labours Lost V.i) reduces in three recursions: 



Even Mary Poppins' sesquipedalian ecptpnesis crumbles to 
six letters in three recursions: 



simplicity and its resolution. The elimination requires only table 
lookup and no multiplications; and the compression is readily done 
manually. The resolution is apparently as good as one can get 
with a selected letter compression code. It effectively flattens 
the high portions of the letter frequency curve, though unlike 
a randomizing code, it cannot totally equalize the distribution. 

The resolution, however, is quite good. Specifically in a test 
o * 1:362 words (chosen from the secret? ry*s handbook 20,000 Words ), 
only thirty of the six letter ciphers (about 0.61%) were non~u:::.cnie 



7. 




A N ©I D(|) S <DS ©A B lA C H M(©N©A R I©N I S M=> 
A N I D © S © B(s) H M R I (g) I S M=> 

A N I D©B®M(g)I I S M=? 




A N I 



D B0S M 



A N I 




A N I D B M 



8. II@N@R I 



H®R I F C 
H(g)I F C B(0 
H I F C B B 



(uj) B=> 





d@C(6)u(§)^?> 

s 

s 

S U P G X U 

The prime advantages of the method are its computational 







ERIC 
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and of the non-unique ciphers all were simple pairs except for one 
instance of three occurrences* The method compresses quickly: 
since all non-initial letters have a *5 probability of being 
retained, the expected length, 1, of an n letter word aftor r 
recursions is: 

L - i + n-1 

2 r 

This indicates that a 43 letter word may be expected to compress 
to six letters in three recursions* 

G6. THE SOUNDEX CODE 

The Soundex code**, though widely used, is of obscure 
origin and has been attributed to Remington Rand* It is a phonetic 
code that tends to create identical codes from similar sounding 
names* If is useful for name searching under conditions of un- 
certainty of spelling, such as occurs in the airline reservation 
problem where it is often required to match a telephoned name in 
a machine file* The code has five steps: 

1* Retain first letter of anme as first letter 
of code. 

2. Eliminate vowels, W, H, and Y* 

3. Eliminate the second consonant of a double 
consonant pair. 

4. Replace the following letters by numbers: 



B, P,F,V 1 

C, G,J,K,Q,S,X,Z,SC,CH,SCH,CK 2 

D, T 3 

L 4 

M,N 5 

R 6 



5* Take the first three or four symbols, and add 
zeros if insufficient phonetic sounds. 

The example below illustrates the process: 

JOHNSEN => JNSN => J525-=> J52 
JOHNSON =S> JNSN => J525~> J52 
JOHNSTON => JNSTN=> J5235=$> J52 
JOHNSTONE =£>JNSTN=> J5235=£> J52 
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Business policy: a management audit 
MANAGEMENT AUDIT- -CASE STUDiES. 



HD 

38 
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Business policy: a management audit 
Business pol I cy 



HD 

38 r 

.6725 
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of book) 



LB 
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KF 

9625 

• T5 

LB 
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• T5 6 
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B 

825 
• P57 
1967 

HD 

5764 

,A6 

K5 

PQ 

4835 

.17 

Z539 
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38 Business policy; a management audit 
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1. Management audlt--Case studies. 

I. Title. 



-N 



o 

ERIC 



o 



67-14417 

658 



HD Greenwood, William T., comp. 

38 Business policy; a management audit 

. G7 25 approach, by William T. Greenwood. 

New York, Macmillan [1967] 

xlv, 656 p. illus., maps. 24 cm. 

Includes bibliographical references. 
1. Management audit--Case studies. 

I. Title. 



n.h. 

HD38.G725 



o 



67-14417 

658 



This library requires 
fcur main entries. 



